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List of abbreviations used in glosses 


first person 
second person 
third person 
animate 
augmentative 
base for enclitic pronoun 
causative 
completive 
demonstrative 
diminutive 
distal 
emphatic 
exclusive 
future 
habitual 
human 
inanimate 
inclusive 
indefinite 


INTJ 
IMP 
IRR 


interjection 
imperative 

irrealis 

discourse particle LA 
locative 

negation 

discourse particle NGA 
participle 

perfect 

plural 

possessive 

potential 
preposition 
progressive 
question particle 
reciprocal 

relative 

singular 

stative 


Orthographic conventions 


Throughout, I use the standard written orthography of ZAI (Alfabeto popular 
para la escritura del zapoteco del Istmo 1956), which generally follows the ortho- 
graphic conventions of Mexican Spanish, for example: 


ch /tf/ 
gand gu /g/ 
hu /w/ 
gú /gw/ 
dx /d3/ 
xh /S/ 
X /3/ 


“Note, however, that x before voiceless consonants is pronounced [J]; often used 
as POSS morpheme. 


Although ZAI is a tonal language, tone is not marked in the ZAI orthography. 

I note the underlying tonal information in the gloss (the superficial tones can 

be straightforwardly derived from the underlying tones — although this requires 

more investigation (Pérez Báez, p.c.) - and use the following notation for tones: 
rising (LH) tone [LH] 


high (H) tone [H] 
low (L) tone unmarked 
Glottalized vowels apostrophe [’] immediately 


after the vowel 

Laryngealized vowels two consecutive vowels, 
[VV] (still within a single 
syllable) 
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11 Motivation and objectives 


Linguists have begun to uncover commonalities across the world's languages 
with respect to the way discourse is organized and cross-linguistic research has 
shown a wide range of typological phenomena associated with different compo- 
nents of information structure (Bernini & Schwartz 2006; Mereu 2009; Erteschik- 
Shir 2007). However, because the great majority of research in this area is done on 
well-documented, non-endangered languages, comprehensive cross-linguistic re- 
search remains difficult. This study aims to conceptualize this interaction in more 
precise ways by presenting the main linguistic strategies by which speakers of 
Isthmus Zapotec, a tonal and verb-initial language spoken in Oaxaca, Mexico, 
convey information. The study of discourse and information structure is scarce 
in tonal and verb-initial languages and extremely lacking for the great major- 
ity of Mesoamerican languages including those in the Otomanguean stock (cf. 
Camacho et al. 2010; Lillehaugen 2008; 2016). 

Isthmus Zapotec (ISO 639 code: ZAI) is a Central Zapotec language of the 
Otomanguean stock spoken by approximately 50,000 speakers in and around the 
region of Juchitán, Oaxaca, Mexico although, increasingly, the language is under 
threat due to a rapid shift to Spanish. Several different attempts at a classifica- 
tion of the Zapotec languages have been made throughout the history of their 
documentation (see Smith-Stark 2003; Campbell 2017b,a for a detailed overview). 
Although no consensus has been reached as to which classification is the most ac- 
curate, it has become clear that the diversity of Zapotec languages is extremely 
rich. Nevertheless, while a considerable amount of work has been done, espe- 
cially in recent years, on the documentation and description of the grammars of 
these languages (e.g. Avelino 2004; Beam de Azcona 2004; Sonnenschein 2005), 
very few studies have been devoted to analyzing naturally-occurring discourse 
and the way these languages are used by speakers in everyday life (cf. Castillo 
Hernández 2014). 

More specifically, I draw on a corpus I collected through 17 months of field- 
work as well as on a relatively large body of existing documentation to present 
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a study of information structure. In this, I generally follow the framework es- 
tablished by Lambrecht (1994) which understands information structure as the 
study of how the different components of sentences - intonation, morphology, 
and syntax - are organized with respect to each other in discourse to signal topic, 
focus, definiteness, and the accessibility of referents. One way to think about in- 
formation structure is in terms of “information packaging’ and by considering 
hypotheses about the receiver's assumptions as crucial to discourse structure 
(Chafe 1994; Lambrecht 1994). These are the sender's hypotheses about the sta- 
tus of the referent of each linguistic expression, as represented in the mind of the 
receiver at the moment of an utterance. Thus, for studies on information struc- 
ture, it is the way the information is transmitted that is critical, rather than the 
lexical or propositional content of a sentence, around which grammar usually 
centers. 

Three main observations motivate this study: 1) the combination of the exist- 
ing documentation and a relatively large and active speaker community offer a 
unique opportunity to document information structure in ZAI and to study the 
language as it is used by speakers in everyday life; 2) as a tonal and verb-initial 
language, the study of ZAI represents a chance to explore the possible combina- 
tions of tone, intonation, morphology, and verb-initial syntax that may occur in 
the coding of information structure, and 3) the analysis of an endangered lan- 
guage contributes to our theoretical understanding of information structure and 
informs our knowledge of language documentation practices and revitalization 
efforts. 

These observations lead to the following four research questions: 


1. What are the different morphological forms that nominal referents in ZAI 
can have and how are these forms used by speakers to express different 
types of cognitive status? 


2. Since constituent order is known to have important discourse functions 
in many languages and since a very small percentage of the world's lan- 
guages are verb-initial, how does verb-initial syntax in ZAI condition the 
ways that speakers formulate their discourse to satisfy their communica- 
tive goals? Are constituent order changes a possible strategy for expressing 
all types of topic and focus constructions or only a subset? To what extent 
do phonetic and intonational cues also play a role? 


3. A discourse particle, LA, is employed often in ZAI discourse. What dis- 
course functions does this particle have? 
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4. What is the distribution of stress and of pauses at the phrase- or discourse- 
level? Are they predictable? How do stresses and pauses interact with the 
tonal system of the language? How do they interact with the expression 
of topic and focus structures? 


I begin by reviewing the main typological characteristics of the language, in- 
cluding the tone system, the structural function of prosody, and constituent or- 
der, and show that the most common arrangement of constituents in ZAl is verb 
followed by subject then object. Verb-initial syntax, however, is often violated as 
the pre-verbal position can be the locus for important discourse functions. The 
pre-verbal position is shown to interact closely with grammatical role and prag- 
matic status of nominals in the expression of topic and focus relations. Through 
the close examination of the form, function, and distribution of ZAI nominals, I 
analyze the different nominal forms used to introduce and track referents and to 
mark referents as more or less accessible. I focus specifically on the distribution 
and alternation of two types of third person pronominal forms, the zero form and 
the overt subject enclitic form, in spontaneous narrative and conversation and 
conclude that an important factor governing their use is the relative thematic 
salience of the referents: the overt enclitic is used for more thematic figures and 
the zero form for less thematic figures. 

I then build on this discussion of nominal forms to address topic and focus 
relations. I find that while sentence focus and predicate focus constructions are 
consistently verb-initial, argument focus constructions may contain either pre- 
verbal constituents (within the clause) or, alternatively, may be verb-initial. No 
evidence is found for pitch accents directly associated with focal material. 

The analysis of topic and focus relations is extended in the latter chapters by 
examining data from narrative and conversational contexts where ZAI speakers 
employ topic and focus constructions for specific interactional purposes. I ex- 
amine a conversational strategy in which ZAI speakers use predicate focus and 
argument focus successively. The combined use of predicate focus and argument 
focus is analyzed as a chiastic structure in which the speaker binds two intona- 
tion units into a couplet to be interpreted together. One effect of this use is to 
extend his/her speaking turn for an additional intonation unit, with the second 
part, the argument focus construction, marking the end of the speaker's turn, 
ceding the floor. 

The work concludes with a detailed look at a multifunctional discourse par- 
ticle, LA. I show that it is used in topic-promoting contexts, as well as to mark 
"scene-setting topics" that have a frame-setting or delimiting function, to indicate 
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changes in topics or boundaries of topical units, and for contrastive topics. 1 con- 
clude that La-marked constructions should be viewed not only as a resource for 
marking various types of topical information, but more generally as a resource 
for organizing talk and interaction. 

Overall, the analysis demonstrates the value of and need for information struc- 
ture studies to document and analyze spontaneous and naturally-occurring dis- 
course, particularly in understudied and endangered languages. The primary goal 
is to extend the analysis of the syntax-pragmatic interface beyond the notions of 
topic and focus to incorporate phenomena that have a function clearly linked to 
the structuring of discourse and interaction. To put it another way, although the 
direct elicitation of topic and focus constructions will be shown to be useful for 
understanding the range of morphological and syntactic combinations available 
to speakers, the close analysis of narrative and conversation offers an oppor- 
tunity to connect information structure phenomena to - and find explanatory 
reasons in - the broader discursive and interactional contexts in which they are 
situated. 


1.2 Ethnographic setting 


ZAl is spoken by approximately 50,000 people in and around Juchitán de Zara- 
goza, in southern Oaxaca, Mexico. The language is under threat due to a rapid 
shift to Spanish which has left towns such as La Ventosa, north of Juchitán, with 
no children actively learning the language (Gabriela Pérez Báez, p.c.). The region 
of Juchitán, Oaxaca was populated by the Zapotecs approximately 200 years be- 
fore Spanish contact, making ZAI one of the latest to diverge from the Central 
branch of the Zapotec language family (Rendón 1995). Today, with the important 
port of Salina Cruz only 30 km south, the city of Juchitán is a small, sprawling 
urban center with 100,000 residents, located on the highway and railroad routes 
that cross the Isthmus of Tehuantepec and create a bridge between the Gulf of 
Mexico and the Pacific Ocean. In a country where the great majority of indige- 
nous languages are associated with small, rural communities, Juchitán is unusual 
because, while it is also home to white and mestizo elites, it has a majority ZAI- 
speaking population which has managed to maintain a very strong indigenous 
identity and culture. This is one reason why the city is home to the first indepen- 
dent indigenous radio station in the country, Radio Teka. 

Still, for almost five centuries, Spanish has served as the language of govern- 
ment, of the formal job market, and of the mainstream media and, increasingly 
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Figure 1.1: A linguistic map of the Isthmus of Tehuantepec (based on 
Paul et al. 2016). Language families represented are Nahuatl (red), Mixe- 


Zoquean (tan), Zapotec (blue), Chontal (green), Huave (orange). 


with each generation, is replacing the indigenous language.! Today, the impact 
of Spanish on ZAI is even stronger than it has ever been, especially since the 
expansion of the public school system and instruction in Spanish about 50 years 
ago. Although the percentage of ZAI-speaking residents older than 50 is quite 
high, the percentage of children that are growing up speaking the language is 
comparatively low, hovering around 507; (Augsburger 2004). So, although stable 
Spanish-ZAI bilingualism has been the norm for several centuries, in many areas 
the language shift from ZAI to Spanish is now occurring very quickly and may 
even complete itself within the next generation (Augsburger 2004). 

Juchitán is distributed geographically into sections and, with the growing pop- 
ulation, the city has extended beyond the original eight sections. In this growth, 
itis increasingly noticeable that the divisions between the sections mark patterns 
of language use such that these patterns roughly correlate with socio-economic 
differences. Although the adult population is overwhelmingly bilingual through- 
out the city, certain sections of the city, like the séptima and cheguigo contain 
the majority of the ZAI-dominant speakers. These sections also contain higher 


"This is true for all or most of the indigenous languages across the country. The complex socio- 
political process that has led to this situation is the subject of Heath (1972). 
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concentrations of people engaged in traditional occupations, such as artisans 
and fishermen. In contrast, sections such as the primera, segunda and tercera are 
Spanish-dominant. These sections are middle-class neighborhoods and contain 
a wider range of occupations.? 

One significant outcome, then, of the increasing rate of shift of the younger 
generation in favor of Spanish is that the range of use of ZAI is being gradually 
reduced to specific sections of the city as well as to certain social networks with 
specific socioeconomic characteristics. The reduction in the range of social situa- 
tions and communicative contexts in which ZAl is employed will no doubt have 
a strong impact on the diversity of genres and styles in which it will come to 
be used in day-to-day life and, concomitantly, on the forms and functions of the 
spoken language itself. 


1.3 Previous work on the language 


The linguist Velma Pickett is responsible for a great majority of the early lin- 
guistic documentation and analysis of ZAI. Beginning her work on ZAI in the 
1950's, much of Pickett's work in those years culminated in her doctoral the- 
sis entitled The grammatical hierarchy of Isthmus Zapotec (Pickett 1960), which 
focused primarily on a syntactic analysis of the language from the perspective 
of tagmemic grammar developed by Kenneth Pike. Pickett continued her work 
on ZAI and, with the establishment of the orthographic conventions, created a 
dictionary (Pickett 1979) and, with Cheryl Black and Vicente Marcial Cerqueda, 
developed a concise speaker grammar (Pickett et al. 1998). The dictionary and 
grammar together give an accurate, though very general, picture of the major 
aspects of the ZAI lexicon, phonology, morphology and syntax. Following Pick- 
ett's work, in the 1980's Carol Mock published several very thorough articles 
on the lexical phonology of ZAI (Mock 1983; 1985a,b; 1988). At around the same 
time, Pickett co-authored an article with Stephen Marlett entitled “The syllable 
structure and aspect morphology of Isthmus Zapotec” (Marlett & Pickett 1987), 
which offers a very good description of the ZAI syllable and the complex system 
of aspectual prefixes. 


?See Saynes-Vásquez (2002), Augsburger (2004), and McComsey (2015: Chapter 1) for a more 
detailed description of the socio-linguistic make-up of the city with respect to its sections. 
In towns such as Xadani and San Blas, which border the main urban areas of Juchitán and 
Tehuantepec, respectively, and supply them with much of the manual labor, the percentages 
of residents older than 50 and of children between five and nine years who speak (or, at least, 
report speaking) ZAl are significantly higher. In other Isthmus towns as well as in Tehuantepec, 
the governmental center of the Isthmus, these percentages are much lower. See also Toledo 
Bustamante (2018). 


1.4 Methods 


To my knowledge, only one documentation project of ZAI has been under- 
taken since the work of Pickett. This was done as part of the Project for the 
Documentation of the Languages of Meso-America (PDLMA). This project is on- 
going and is primarily dedicated to the building of a lexicon (Kaufman et al. n.d.). 
Neither the documentation of prosody at the phrase or discourse level nor the 
documentation of information structure are part of that project. 

Therefore, no studies on narrative discourse or information structure in ZAI 
have been published or even conducted. Moreover, studies on discourse are ex- 
tremely lacking for the great majority of Zapotec languages as well. One signifi- 
cant exception to this is the work by Mark Sicoli (Sicoli 2007; 2010) on the use of 
tone and intonation in Lachixío Zapotec (an Eastern Zapotecan language). Other 
existing work on Zapotec discourse has been done by linguists affiliated with the 
Summer Institute of Linguistics (SIL) (Persons 1979; Long 1985; Benton 1987; 1997; 
Kreikebaum 1987; Riggs 1987; Ward 1987; Piper 1995; Heise 2003; Riggs & Marlett 
2010). These studies have primarily descriptive goals, they tend to focus on folk 
and written narrative, and are concerned mostly with specific syntactic problems 
and analyses at the sentence or paragraph level. Virtually no attention is paid to 
the role of intonation or to the major components of information structure. 

Because of these studies and because of the amount of knowledge already 
gained in the areas of phonology, morphology, lexicon, and syntax, the opportu- 
nity to document and analyze information structure in ZAl is open. The present 
project looks to build on this wealth of previous work. The close study of ZAI 
offers a unique opportunity to explore the possible combinations of prosody, 
morphology and verb-initial syntax that may occur in the coding of information 
structure. Establishing the correlations between these areas is best determined 
by the analysis of spontaneous discourse. At the same time, however, one of the 
most straightforward ways to determine the range of possible constructions is 
via elicitation since this methodology makes it possible to create unambiguous 
contexts which trigger clearly distinct topic and focus structures. In this study, 
I take both methodological approaches. The rationale for utilizing this combina- 
tion of methodologies is discussed in the next section. 


1.4 Methods 


In collecting the corpus that is the basis for this study, I worked with bilingual 
ZAI-Spanish language consultants in Juchitán over a 17-month period to record, 
transcribe, annotate and translate spontaneous speech and collect elicited na- 
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tive speaker judgments of constructed examples. The description that follows of 
information structure of the language fills a crucial gap in the empirical base 
of knowledge about ZAI as well as Zapotec languages more broadly, and con- 
tributes important data for more general theoretical questions about language 
structure and use. 


1.4.1 Corpus creation 


During the fieldwork stage, 1 recorded spontaneous speech and supplemented 
this with data from elicitation through traditional field methodologies. The col- 
lected recordings ensure that naturally-occurring speech forms have been doc- 
umented while the elicitation sessions ensure that these forms are considered 
with respect to a broader set of possible combinations of tone types, intonation 
patterns and constituent orders. In the end, the documentary corpus allows for 
a more complete understanding of the range of constructions that are available 
to ZAI speakers and how they are employed to respond to specific discourse 
motivations. 

In this, the project adopts a "discourse-centered approach" for documentation 
and description (Sherzer 1987). Focusing on naturally-occurring speech makes 
it possible to find and analyze words and structures that may not surface when 
sentences from the contact language are translated into the target language. 

There are several reasons for focusing this documentation project on spon- 
taneous speech. First, in contrast to other types of spoken genres such as rit- 
ual speech or traditional folklore which often tend to be formulaic, spontaneous 
speech and dialogue have the advantage of being naturally-occurring while pro- 
viding extensive information about information structure. Second, it offers the 
possibility of simultaneously documenting popular oral histories. Third, sponta- 
neous speech is cross-linguistically under-documented. Fourth, the long schol- 
arly tradition and extensive analysis of conversation across disciplines in the 
social sciences and humanities offers a solid foundation upon which linguistic 
analyses can be carried out as well as a potentially fruitful avenue to pursue in 
the dissemination of the data. In the end, by focusing on spontaneous speech, 
this project underlines the importance of documenting a speech genre that is 
meaningfully embedded in the daily social lives of the speakers. 

Still, it is important to recognize that specific constructions, word or intona- 
tion contours of interest might occur only very rarely in running speech, which 
makes it impractical to rely solely on free narrative and/or conversation for lin- 
guistic research of pre-determined phenomena. This is the point made by Him- 
melmann (2006), specifically with respect to the documentation of prosody, which 
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a part of this project will be particularly concerned with. To this end, structured 
games and nonlinguistic triggers such as pictures and short video clips, were 
employed in elicitation sessions designed to document a range of intonational 
contours and constituent orders. 

As noted above, Zapotecan languages are well known for being phonologically 
complex, containing complex interactions between tone, stress, and voice quality 
modifications such as glottalization and larygealization. The documentation of 
ZAI discourse represents a chance to document the interesting phonological and 
phonetic variations of the language in use and the annotation and analysis of 
prosodic phenomena form a central part of this project.? 


1.4.2 A discourse corpus 


The collection of material for the discourse corpus employed native speakers 
of ZAI as language consultants and used the following data collection methods: 
1) audio and audiovisual recording of naturally occurring speech, and 2) tran- 
scription and analysis of the data. The main purpose was to begin a collection 
of recordings with samples of spontaneous speech, something not represented 
currently in any archives of the language. 

The language is undergoing shift, so it was important to responsibly archive 
the data for future researchers and community members. Because of the hot and 
humid climate and because the majority of recordings were done outdoors, I used 
a Zoom H4n recorder and a Sony ECM-MS 957 external microphone as well as 
lavalier microphones. Audio recordings were made at a sampling rate of 16 bit/44 
Khz. Visual recordings were made using a digital video camcorder with the same 
external microphones. All recordings were digitized and converted into WAV, 
MPEG1 and MPEG2 files to conform to Open Language Archives Community 
(OLAC) standards. 

In-field processing of the data included the transcription, translation and anno- 
tation ofthe recordings with the help of native-speaker language consultants (but 
not the speakers themselves). The texts were represented and time-aligned to the 
primary data using ELAN software in a multi-tiered analysis: orthography using 
the Isthmus Zapotec conventions; a morpheme-by-morpheme tier with glosses 
in Spanish and English using transparent terminology; and free translations in 
both Spanish and English. All phonetic analysis was done using Praat. 

Metadata for each recording is provided based on the International Standards 
for Language Engineering Metadata Initiative (IMDI) so as to ensure that all the 


? After all, not marking prosody in transcription may result in *making something perfectly 
determined in speech undetermined in transcription" (Scarano 2009: 57). 
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relevant metadata is systematically and transparently documented. The audio 
and video recordings have been archived at the Endangered Languages Archive 
Repository (ELAR) of the Endangered Languages Documentation Programme at 
the School of Oriental and African Studies of the University of London. They are 
accompanied by transcriptions of the data and metadata files with information 
for each recording, all done in XML format. 

The benefits of utilizing these standard documentation practices are twofold: 
they facilitate the proper archiving of the materials and the wider use of the 
resources by other people, including the community itself and they also facilitate 
future analyses by allowing for searches across structured annotations. 


1.5 Organization of the study 


This chapter discusses the motivation and objectives of the project and presents 
background information on Isthmus Zapotec and the speech community that 
is the subject of the research. It briefly describes the Isthmus Zapotec speaking 
population and characterizes the language's endangered status along with the 
socio-historical and cultural factors that shape the current linguistic situation. It 
surveys the existing documentation for ZAI, showing how the documentation 
of discourse aims to fill an important gap in the current documentation of the 
language. The chapter concludes with a review of the methodology employed in 
the data collection and creation of the corpus. 

The following chapter presents a grammatical sketch of ZAI. It addresses the 
most relevant typological characteristics of the language, including, the phono- 
logical system, the structural function of prosody, and verb-initial syntax, focus- 
ing specifically on the role of constituent order in the expression of information 
structure in ZAI and showing the pre-verbal position to be the locus for a va- 
riety of discourse functions. It concludes with a summary of the main research 
questions that guide the rest of the study. 

The main objective of Chapter 3 is to explore the relationship between, first, 
the form and distribution of nominals and, second, their function in discourse 
to introduce and track referents and to mark referents as more or less accessible. 
This discussion is framed in terms of the combined lens of Preferred Argument 
Structure and Accessibility theory. It then moves on to a discussion of the cog- 
nitive status of the various nominal forms available to ZAI speakers. Chapter 4 
focuses specifically on the contrast between the overt 3sc subject enclitic and a 
zero form. It explores the distribution and alternation of the two third person cli- 
tics in narrative and conversation and argues that an important factor governing 
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the use of these forms is the relative thematic salience of third-person referents. 

The goal of Chapter 5 is to analyze the focus structures available in ZAI. It 
does so by presenting a survey of the main focus marking constructions of sen- 
tence focus, predicate focus, and argument focus (Lambrecht 1994) in order to 
place ZAI information structure within the typology of focus structure proposed 
by Van Valin (1999). The chapter explores the extent to which ZAI may be con- 
sidered a more or less "rigid" verb-initial language with respect to the kinds of 
pragmatically-marked information that may appear in pre-verbal position. The 
chapter ends with the consideration of a parallel use of sequenced predicate focus 
and argument focus constructions in conversation. 

Chapter 6 extends the analysis and the observations made in previous chapters 
to provide an analysis of the main topic marking strategies in ZAL, including 
presentational, topic-comment, and identificational constructions. The chapter 
ends with a discussion of the particle LA and its functions in conversation to 
mark pre-posed adverbial clauses and left-detached contrastive topics and, more 
generally, to negotiate and secure common ground between interlocutors. 

The final chapter summarizes the main conclusions of the study and proposes 
avenues of further research. 
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2 Background: the basic grammatical 
structures of ZAI 


This chapter presents a short description of the main typological characteristics 
of the language summarizing the aspects of ZAI grammar that are most relevant 
to the analysis of information structure. This description lays a foundation on 
which to explore the interrelationships between nominal forms, constituent or- 
ders, particles, and prosodic patterns. The chapter begins with a description of 
the segmental and tonal inventory and a brief explanation of the orthographic 
conventions used throughout. It then builds on an analysis of the ZAI tonal sys- 
tem in order to discuss the basic prosodic properties of the language at the phrase 
and discourse level, in particular the structural function of stress and pauses. The 
chapter then continues with an overview of ZAI verbal forms and basic clause 
structure. This leads into an examination of the main constituent orders in ZAI 
and concludes with a closer inspection of the pre-verbal position. 


2.1 The segmental and tonal inventory 


In this section, I offer a brief sketch of the segmental inventory and phonolog- 
ical system of ZAI. The information presented in this section is important for 
understanding the prosodic and verbal structures discussed in the remainder of 
the chapter. 


2.1.1 ZAI segmental inventory 


ZAI contains the segment inventory shown in Tables 2.1 and 2.2. 

The relevant contrast between consonants with the same place of articulation 
has traditionally been referred to as a fortis-lenis contrast (Pickett 1960, Pickett et 
al. 1998; see also Arellanes 2009, Chávez Peón 2010 with respect to other Zapotec 
languages).! This fortis-lenis contrast parallels the voiced-voiceless distinction, 


‘This contrast has also been referred to as a morpho-phonological contrast between simple and 
geminate consonants (Swadesh 1947). 


2 Background: the basic grammatical structures of ZAI 


Table 2.1: ZAI consonant inventory 


p t tf k 
b d d; g 
f* S f h 
Z 3 
m n n 
n: 
r* 
f 
1 
1: 
w y 


(* = Appear only in loanwords) 


where the lenis consonants are the voiced consonants and the fortis consonants 
are the voiceless consonants. 

The five modal vowels all have glottalized and laryngealised counterparts (see 
Table 2.2). 


Table 2.2: ZAI vowel inventory 


i i? i'i u u? ufu 
e e? ele o o? olo 


a a? ala 


(Modal, laryngealized, and glottalized vowels) 


Glottalization is realized as a post-vocalic glottal stop in a stressed monosyl- 
labic root (1a) (the prefix ri is a habitual marker) and, if the root is disyllabic, also 
simultaneously as a word-final glottal stop in pre-pause position (1b). 


(1) a. ri-nda' [rindà?] ‘stinks’ (cf. ri-ndá [rindá] “arrive”) 


b. bé'üe'[bé?pe?] ‘alligator’ (cf. beñe [bene] ‘mud’) 
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2.1 The segmental and tonal inventory 


Laryngealization is realized as creaky vowel quality and a double pulse to the 


syllable (2a,b). 


(2) a saa [sà/a] ‘music’ 
b. na-dxübi [nà-dzi'ibi] ‘fearful’ 


Glottalization and laryngealization each interact closely with stress in ways that 
are discussed in more detail in 82.2.1. 
2.1.2 The tonal system 


There are three phonemic tones: high (H), rising (LH), and low (L). These tones, as 
they appear on monosyllabic and disyllabic morphemes, are shown in Table 2.3.? 


Table 2.3: ZAI tonal inventory 


Monosyllabic  Disyllabic 


H dxé léxu 
[d3é] [lé:xú] 
“boy” ‘rabbit’ 
LH dxi yúzé 
[dsi] [yŭ:zě] 
‘quiet’ ‘livestock’ 
L ru benda 
[ru:] [ben:da:] 
“cough' ‘fish’ 


Importantly, morphemes which contain a rising (LH) tone on the final sylla- 
ble carry a floating H tone. The floating H tone appears on the final syllable of 
these words in isolation, but floats onto the following syllable utterance-medially. 
Two examples of words uttered in isolation are given in Table 2.4, along with an 
example of these used in a phrase in which the first word now appears utterance- 
medially. 

Whereas the word né is pronounced with a H tone in isolation, when used 
utterance-medially, the floating H tone appears on a following L tone syllable 
causing the word dubá to be pronounced dúba. 


?One additional attested tonal pattern not shown here, LH L, is found only in loanwords, e.g. 
male “compadre”, óra ‘hour’. 
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2 Background: the basic grammatical structures of ZAI 


Table 2.4: Morphemes with floating H tone 


Monosyllabic  Disyllabic 


né dubá 
[né:] [du:ba:] 
LH LLH 
‘and’ “maguey” 


Used utterance-medially 


ne dábá 
“and maguey” 


Finally, it is important to note that the various surface tone types are not all 
manifested with equal regularity. Pickett's Vocabulario (Pickett 1979) reports a 
frequency of 6% for words that contain a syllable with a high (H) tone, 22% for 
words that contain a rising (LH) tone, and 17% that contain a floating H tone. 
Words containing only low (L) tone syllables are the most common, comprising 
about 55% of the lexical inventory. In the next section, I explore the place of the 
ZAI tonal system within the broader prosodic system of the language. 


2.2 The structural function of prosody in ZAI 


This section is concerned with the structural function of prosody in ZAI, that is, 
with the role of prosody in the segmentation of the speech signal into groups 
of words. In what follows, I first present a more detailed account of the ZAI 
phonological system than that provided in 82.1 by offering a summary of the in- 
terrelationships between tone, laryngealization, glottalization, and stress. After 
a short review of the existing literature on the structural function of prosody in 
other Zapotec languages, I then explore some of the ways that tone, laryngealiza- 
tion, glottalization, and stress interact within the ZAI prosodic system. Finally, I 
touch briefly on the role of prosody in the marking of information structure, a 
discussion that will be taken up again in more detail in $5. 
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2.2 The structural function of prosody in ZAI 


2.2.1 Tones, VOMs and stress 


Morphemes in ZAI may be either monosyllabic or disyllabic. As was shown 
above, ZAI has three phonemic tones: high (H), rising (LH), and low (L), as well 
as two voice quality modifications (VOMs), laryngealization and glottalization, 
that may participate in lexical contrasts. 

In addition, stress, although not lexically contrastive, also plays a key role in 
ZAI phonology. As a rule, there is only one stressed, double-moraic segment 
within each phonological word. In disyllabic words, stress falls on the initial 
syllable. Stressed syllables generally contain long vowels. There are two cases, 
however, in which the characteristically long, stressed vowel does not occur: 1) 
if the post-tonic syllable begins with a voiceless obstruent, a nasal, a liquid or 
a glide which undergoes gemination (geminates are not contrastive in ZAI), as 
in the di-syllabic words mili [mil:i:] ‘mullet’ and chupá [chup:á:] ‘two’; or 2) if 
the morpheme is glottalized, as in the disyllabic word bé’fie’ [bé?ñe?] ‘alligator’, 
in which case stress is heard only as heightened intensity and raised pitch regis- 
ter. In short, when stressed, the ZAI syllable nucleus may either be a long vowel 
(V:), a vowel plus a lengthened consonant (VC:), a laryngealized vowel (VV), or 
a glottalized vowel (V^). Clitics do not bear stress and maintain a CV structure. 

Table 2.5 summarizes the interactions between tones, laryngealization, glottal- 
ization, and stress in stressed monosyllabic and disyllabic morphemes (for words 
uttered in isolation). 

If a morpheme is stressed, stress falls on the initial syllable. Duration is the 
primary phonetic indicator of stress as the stressed syllable must be heavy: ei- 
ther the vocalic nucleus is long or the post-tonic consonant is fortis (a geminate) 
leaving the vocalic nucleus short. Pre-pause syllables are also long. 

However, three additional observations are important to note. First, when we 
compare morphemes in stressed and unstressed contexts, we see that the short- 
ened syllables in unstressed and utterance-medial positions carry fewer tones. 
In particular, LH contour tones only arise on long syllables, i.e. on syllables that 
are either stressed or before a pause. When unstressed, the syllable nucleus is 
only a single vowel and the contour tones are 'simplified' to a level H tone. This 
strongly suggests that the mora is the tone-bearing unit (TBU) and that the most 
appropriate representation is most likely one in which contours are composed 
of a sequence of level H and L tones linked to the mora. Second, the data also 
suggest that the L tone is the more unmarked of the two tones. In addition to 
being the most distributionally unrestricted tone, L is also always the one that is 


17 


2 Background: the basic grammatical structures of ZAI 


Table 2.5: Tone, laryngealization and glottalization (in words uttered 
in isolation) (underline notes the stressed syllable in disyllabic roots). 


plain glottalized laryngealized 
H dxé: lé:xu: ri-ndá' | na-yaná' 
tone H HL LH LLH 
"boy “rabbit” “gets hot’ ‘hot/spicy’ 
na-ya’ni’ 
LLH 
‘clear’ 
LH dxi: yú:zé: ri-ndá' niu nadxiibi: 
tone LH LH LH LLH LH LLH LH 
“quiet” ‘livestock’ “gets bitter’ ‘there is’ "fearful 
LH né: du:bá: búu  ridxiichi: 
tone LH LLH LH LLLH 
“and” “maguey” ‘charcoal’ ‘be angry’ 
L ru: ben:da: rinda?  na-ya'qui” chii ^ nadxiibi’ 
tone L LL LL LLL L LLL 
‘cough’ fish’ ‘stinks’ ‘burnt’ ‘ten’ ‘smooth’ 


deleted in contour tone ‘simplification’. 


Furthermore, this raises an important question about the relationship between 
the realization of contour tone and the structuring function of prosody in ZAI 
discourse: if contour tones in ZAI only occur on stressed syllables and before a 
pause, what is the distribution of stress and of pauses at the phrase- or discourse- 
level? Are they predictable? These questions are addressed in the following sec- 
tions. First, I briefly review previous studies on Zapotec prosody. 


3Stress and tone have been argued to be closely interrelated in a number of languages (for 

general discussion, see Yip 2002; Zhang 2002). In particular, pitch movement has been shown 
to be more common under stress (Zhang 2002; Zoll 2003). This is also true in ZAI as contour 
tones are shown to commonly surface on stressed syllables. An additional manifestation of 
this is that stressed L tones have a phonetically falling pitch whereas unstressed syllables with 
L tone are phonetically level tones. 
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2.2 The structural function of prosody in ZAI 


2.2.2 Previous studies on Zapotec prosody 


To my knowledge, the only extensive study that has been done on phrase-level 
prosody in a Zapotecan language has been the work of Mark Sicoli (2007; 2010). 
In his PhD dissertation, A linguistic ethnography of tone and voice in a Zapotec 
region, Sicoli devotes two chapters to an analysis of prosody in Lachixío Zapotec 
(Eastern Zapotec) at both the word level and the phrase level. Although Lachixío 
Zapotec and ZAl are only distantly related, it is not surprising that many of 
Sicoli's observations with respect to prosodic structure hold for ZAI as well. 

He describes Lachixío Zapotec as a “stress-timed” language where there is only 
primary (no secondary) stress which is non-iterative, that is, has at most one 
stress foot. In addition, Sicoli notes that emphasis is marked by a geminate medial 
consonant or stressed vowel of the primary stress foot and that this can serve 
focus functions by marking the edge of a phrase. 

Based on these observations, Sicoli goes on to analyze the intonational system 
as composed of four nested levels: the phonological word, the metrical foot, the 
intermediate phrase, and the intonation phrase. The maximal phonological word 
is composed of a clitic phrase with the following structure: [[proclitic [stressed 
root]] enclitic]. The metrical foot, the unit counted for rhythm, is trochaic. The 
intermediate phrase, a unit between the intonation phrase and the phonological 
word, is defined by phonetic cues such as phrase-final, non-phonemic length- 
ening. The intonation phrase is defined prosodically by the structure of bound- 
ary tones (phrase-final intonation patterns) and by optional cues, such as pause, 
breath, and non-phonemic lengthening of phrase-final vowels. 

Aside from boundary tones, such as a L boundary tone that marks the ends 
of speakers' turns and a H boundary tone that indicates non-finality, two factors 
show that phonological phrasing can have morphosyntactic functions in Zapotec 
speech: 1) case is unmarked morphologically; and 2) body part nouns may com- 
bine with other nouns to form locational expressions (Sicoli 2007: 132). 

Sicoli provides an illustrative example of the second one of these factors. In 
Lachixío Zapotec intermediate phrases help to distinguish between NPs that are 
grouped together as phonological phrases and those that form separate phono- 
logical phrases; this is most clearly seen in the use of body part nouns in “quasi- 
prepositional” phrases (2007: 133).* For example, the two-noun phrase lattsa nikko 
(lit. chest + dog) can be either a possessive construction meaning ‘the chest of a 
dog’ or a locational construction meaning ‘the side of a dog’ (2007: 134). In the 
possessive structure, the H final intermediate phrase tone is placed at the end of 


“For more work on body part nouns in Zapotec see e.g. MacLaury (1989); Lillehaugen (2006). 
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2 Background: the basic grammatical structures of ZAI 


the first word (the possessed object), grouping these words as two phonological 
phrases [[lattsa:][nikko]]. For the locational reading, the second word receives 
a H final phrase tone that groups these words as a single phonological phrase 
[lattsa níkko], thus indicating a prepositional use.? Compensatory lengthening 
provides another phonetic cue. 


2.2.3 Prosodic properties of intonation units in ZAI 


Otomanguean languages have long engaged researchers in the study of the pho- 
netic realization and phonological complexity of stress, tone and vowel phona- 
tion (Arellanes 2009; Avelino 2004; Chávez Peón 2010; Mock 1988; inter alia). 
With the objective of understanding in detail the interaction between stress, tone 
and vowel phonation at the word or root level, the main sources of data for these 
studies have been words and phrases elicited in isolation. This section comple- 
ments this growing body of work by presenting a preliminary analysis of the 
sound patterns in intonation units in ZAI, using naturally-occurring data as evi- 
dence. 

To recap, ZAI has conserved a CV(CV) structure at the root level. Vowels bear 
one of three tones - low (the most frequent), high, and rising - and have three 
phonation types - modal, glottalized and laryngealized. At the root and word 
level, stress is assigned predictably to the first syllable of the root. The vowel of 
the stressed syllable is short when the following consonant is fortis, and long 
when the following consonant is lenis. Various types of extrametrical units can 
attach to a root, including tense, aspect and mood prefixes as well as pronomi- 
nal enclitics, yet stress assignment remains dependent on the root structure. In 
discourse, however, stress and vowel phonation may undergo lenition under cer- 
tain circumstances. It is this process and the resulting patterns that are the focus 
here. 

In this section, as in the remainder of the study, I use the "intonation unit” 
(IU) (Chafe 1994) as the basis for transcription and analysis. The reason for this is 
that IUs have been shown to operate as a fundamental unit of cognitive process- 
ing, social interaction, and other domains (Chafe 1994; Du Bois et al. 1993; inter 
alia). To recognize boundaries between IUs, I follow Du Bois et al. (1993: 100) in 
identifying five major perceptual and acoustic cues: (1) a coherent or unified into- 


?Sicoli also takes this as evidence for the existence of intermediate phrase tones as opposed to 
intonational pitch accents since they occur at the end of the phrase on an unstressed syllable. 
Mock (1988: 204), in her analysis of ZAI phonology, in fact uses a similar example as evidence 
that ^words in ZAI need not receive stress since stress ultimately occurs for discourse-related 
reasons.’ She does not, however, elaborate on this point. 
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nation contour; (2) a resetting of the baseline pitch level at the beginning of the 
unit (pitch reset); (3) a pause between two units; (4) a sequence of accelerated 
syllables at the beginning of the unit (anacrusis); and (5) a prosodic lengthen- 
ing of the syllables at the end of the unit.* This last cue, IU-final lengthening, is 
especially relevant for ZAI: the delimitation of IUs in ZAI is aided by the fact 
that glottalized and laryngealized vowels in IU-final position are immune to the 
lenition process. 

Chafe (1994) distinguishes between three types of IUs: 1) substantive, 2) regu- 
latory and 3) fragmentary. The analysis that follows will focus on the prosodic 
properties that can be observed in substantive IUS, that is, IUs that convey ideas 
about events, states, or referents that participate in the communication of propo- 
sitional content. The data in my corpus shows that, in substantive IUs, stress — 
whose main phonetic correlate I assume to be duration - resides in the last root 
of each constituent in a clause and lenites in all other elements towards the left. 

Consider the brief sequence of substantive IUs in (3). The first line shows the 
superficial phonetic representation and the second line shows the morpheme-by- 
morpheme underlying representation. 


(3 01 racá gidáa nisa luní 
raca! guilH-daa nisa lu-nil 


then iMP-empty water face-3sc 
"Ihen empty water in it; 

02 guiába chupa  chóna  xúba luni lá 
guilH-yaba chupa! chonnal! xuba’ lu=nilH laH 


IMP-fall two three corn face-3sG LA 


“Add a few kernels of corn to it; 


Stress is realized on the first syllable of the last root of each main verb and 
each argument NP. In the first line, stress falls on the verb root -daa ‘to empty’. 
This is observed in the rearticulated vowel, which is fully realized. Stress also 
falls on the first syllable of nisa “water”, which contains a modal vowel that is 
short, followed by a lengthened fortis consonant. The body-part term lu face”, as 
head of the locative phrase, also receives stress and the modal vowel is therefore 
long. In the second line, stress falls again on the first syllable of the verb root, 
-yaba “fall”, and on the first syllable of xuba’ ‘corn’. These two words also contain 
long modal vowels. 


ĉIt is important to note that the presence of any of these is neither a sufficient nor a necessary 
condition, as many may occur for reasons other than an IU boundary and some may be difficult 
to identify under certain conditions. 
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Other words, such as connectives (e.g. racá “then” in line 1 and modifiers (e.g. 
chupa chonna “a few” (lit. “two, three”) in line 2 are not stressed. Because stress 
does not fall on modifiers, the fortis consonants following the modal vowels ([p] 
in chupa and [nn] in chonna) are not fully lengthened. This can be seen if we 
compare them to the fortis consonant in nisa, in line 1, which does receive stress 
and is thereby considerably longer (146ms for /s/ in nisa vs. 84ms for /p/ in chupa 
and 75ms for /n/ in chonna). Note also that the modal vowel of the unstressed 
pronominal clitic =ni “3sc’ is lengthened in IU-final position, 151ms in line 1, but 
is short otherwise, 59ms in line 2. Similarly, =ni carries an underlying rising tone 
with a floating H and is pronounced with a rising tone in line 1 when lengthened 
in IU-final position, but is pronounced with a low tone when short in line 2 (and 
the H tone floats to the following syllable). 

What emerges from an analysis ofIU sequences such as that in (3), is that stress 
in ZAl is predictable at the word or root level and is likewise predictable within 
substantive IUs. The relevant generalization can be stated in terms of syntactic 
constituency: the last root of each VP or NP constituent receives stress and stress 
lenites in all other elements to the left. 


2.2.4 Prosody in ZAI information structure: some initial remarks 


In the previous sections, I briefly described the phonology of ZAI, including its 
tonal system, with high, rising and low contrastive tones. As was seen, this tonal 
system interacts in complex ways with vowel phonation and a fortis-lenis dis- 
tinction in consonants. In addition, I observed that stress operates at the phrase 
level, concluding that the last root of each VP or NP constituent receives stress 
and that stress lenites in all other elements to the left. 

This basic understanding of the phonological system of ZAI will make it possi- 
ble in Chapter 5 to investigate the contribution of prosody to information struc- 
ture in ZAI. There, I will take up the question of whether topic and focus con- 
stituents have a constant prosodic realization and whether stresses and pauses 
are involved in the realization of topic and focus structures. Since one common 
strategy in languages to communicate the status of a referent as new or focused is 
via pitch accent, one goal in that chapter will be to determine whether this strat- 
egy is available in ZAI as well. We will see, however, that the extent to which 
phonetic and intonational cues play a role in the expression of information struc- 
ture in ZAI is minimal and that information structural categories and relations 
are expressed mainly through the manipulation of constituent order. 


22 


2.3 Clause structure and constituent order correlations in ZAI 


In the next section, I move on to a review of verb and clause structure and of 
constituent order correlations in ZAI. This will complete the brief description of 
the typological characteristics of the language that will set the foundation for the 
analysis in the remainder of the study. 


2.3 Clause structure and constituent order correlations in 
ZAI 


This section begins with a review of basic verbal morphology. It then addresses 
the question of constituent order correlations in ZAI to determine whether the 
language exhibits tendencies that correlate with V-O order rather than with O-V 
order, as has been claimed for most, if not all, Zapotec languages. I conclude the 
section, and the chapter, by examining the role that constituent order may play 
in the expression of information structure and present data that identifies the 
pre-verbal position as the locus for a variety of discourse functions, including 
the expression of topic and focus relations. 


2.3.1 Verbal morphology 


Like most verb-initial languages, ZAI employs verbal prefixes. Verbs obligato- 
rily inflect for tense-aspect-mood (TAM). In addition to TAM, verbs also inflect 
optionally for causative.’ Also, if the subject is not a full NP, the verb can be 
followed by a subject pronominal clitic. The basic order of the morphemes in the 
ZAI verb can be represented as follows: 

ASPECT-(CAUSATIVE)-root-(MODIFIER)-(SUBJECT CLITIC) 

Verb roots may belong to one of four verb classes, based on the aspectual 
prefixes they can combine with. Detailed studies of the morphophonemics of 
ZAI verb classes are provided in Marlett & Pickett (1987), Enríquez Licón (2008), 
and Pérez Báez (2015). 


"Overall there is a tendency for suffixes to be associated with OV languages and prefixes with 
VO languages. However, this is only a unidirectional correlation: if all affixes in the language 
are suffixes, the language is more likely to be OV. This correlation is not a strong one, and 
prefixes in OV languages are not at all rare. In other words, we can say that OV languages 
more commonly have suffixes, but we cannot say that VO languages more commonly have 
prefixes (Dryer 2007). 

*For other foundational work on Zapotecan verb classes, see Smith-Stark (2002) and Campbell 
(2011). 
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2 Background: the basic grammatical structures of ZAI 


A few additional comments are in order with respect to the TAM prefix.? Ta- 
ble 2.6 provides a list of the eight aspectual prefixes found in ZAl as well as a 
short summary of some of the observations made by previous scholars. 


Table 2.6: ZAI Tense-Aspect-Mood system 


Prefix TAM Description/Example 


ri-, ru- Habitual Used for habitual or repeated actions in past or 
present, but never future 


bi-, gu- Completive For finished actions, typically in past but not 
necessarily (e.g. future perfect) 


ca-, cu- Progressive For continuing actions in past, present or future 
but may be temporal when used for future 


Za-, Zu- Future For future actions not yet begun, certain 


ni-, nu-, ñ-  Irrealis For something that is contrary to fact; for 
something that did not happen 


gui-, gu- Potential Future action in relation to the time indicated by 
the main verb or in relation to utterance time 
used for subordinate clauses also, ‘to want’ or ‘to 
like to' (in the future) in some imperative 
constructions 


hua- Perfect For past actions that have occurred more than 
once, also used in the negative to show a time 
during which something has not happened 


na- Stative Forms a stative verb, more limited distribution 
occurs with about half of the roots 


For the purposes of this study, the TAM prefix will be referred to as an as- 
pectual prefix, but no claim is being made as to the specific syntactic-semantic 
function of these prefixes and a complete analysis of the ZAI TAM system is 
outside the scope of this project. 


?Pickett et al. (1998) describes the ZAI TAM system as essentially an aspectual system with only 
one tense prefix (future). Mock (1990) describes the system as just aspect and mood, while 
Suárez (1983) describes the system as one that combines tense, aspect and mood. A complete 
study of the ZAI TAM system would be extremely valuable (see Pérez Báez (2015); also Sicoli 
(2015) for the TAM system of Lachixío Zapotec . 
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Finally, it should also be noted that there is no morphological case marking 
on nouns and there is no agreement between the verb and any of its arguments. 
Some features of ZAIthat are canonical of most verb-initial languages are: adjec- 
tives generally follow nouns, possessive constructions are possessor final, and 
the use of prepositions rather than postpositions. I address constituent order cor- 
relations further in the next section, where I analyze the position of the verb with 
respect to the direct object. 


2.3.2 Constituent order correlations 


Previous research on ZAI has claimed that the most common arrangement of 
constituents is verb followed by the subject then any objects (Pickett 1960; Pick- 
ett et al. 1998).!° Verb-initial languages are much less common than verb-final 
languages (Payne 1995). However, it is also generally understood that “no lan- 
guages are rigidly verb-initial in the same sense that some languages are rigidly 
verb-final” (E. Keenan, quoted in Payne (1995: 455)). These two facts make the 
study of constituent order and of verb-initial languages challenging as there are 
well-known problems with establishing the relevant criteria to determine the 
basic constituent order in a language. Salient among these are two particular dif- 
ficulties: 1) the order of subject and verb and the order of object and verb are 
often easier to identify while the order of subject and object is often more diffi- 
cult to identify; and 2) pronouns may exhibit constituent order properties that 
differ considerably from lexical noun phrases. 

In determining these patterns for a language, should the relevant criterion be 
one of frequency, of distribution, or of pragmatics? In constituent order typology, 
frequency has been the primary criterion used (Dryer 2007). It can be argued that 
differences in frequency often provide a more reliable test than other tests (where 
the difference is large enough). However, differences in frequency might be an 
artifact of a particular set of texts, due to genre specific or speaker idiosyncracies, 
for example, and one might therefore find very different frequencies in a different 
set of texts. Also, frequency counts of some languages may not reveal one order 
as noticeably more frequent than the other. Additionally, it can also be argued 
that because it is not part of the grammar of the language, frequency should not 
be used widely as a criterion (Dryer 2007). 


The same is true for most if not all Zapotec languages (see e.g. Lee 2000 for San Lucas Quiavini 
Zapotec (Central); Beam de Azcona (2004) for Coatlán-Loxicha Zapotec (Southern); Sonnen- 
schein (2005) for San Bartolomé Zoogocho Zapotec (Northern); Sicoli (2007) for Lachixío Za- 
potec (Eastern)). 
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A criterion of distribution refers to whether the fact that one order, found to be 
in some way less restricted in its distribution, can be used as an argument that it 
is more basic than another, more restricted order. In a similar fashion, one order 
in a language may be considered pragmatically neutral and another to have some 
added pragmatic effect. However, it may not be obvious that one order adds any 
additional elements and, instead, the two orders may simply have a difference in 
meaning (e.g. OV order may be associated with indefinite objects and VO order 
with definite ones). 

In this section, I analyze the correlates of various grammatical elements with 
the relative order of verb and object in order to determine a tendency in ZAI 
toward either verb-object (VO) order or object-verb (OV) order. As will be seen, 
all but two of the elements correlate with a VO order, as would be expected. 
The section that follows will discuss the subject position and will show that the 
exceptions to the V(S)O order are the ones that are pragmatically marked. 

The universal tendencies associated with OV versus VO order are found in 
languages in which there is considerable flexibility of constituent order, even 
among languages in which one order outnumbers the other by a frequency of 
only 2 to 1 (Dryer 2007). These elements are listed in Table 2.7. 

Examples for each are provided in the following discussion. 


2.3.2.1 Use of prepositions 


ZAl uses prepositional phrases, as in the following two examples: 


(4) má bietebe délu  yaga qué 
ma’  bi-yete-belH delu yaga que! 
already COMPL-descend=3.HUM PP face tree DIST 


‘He already came down from on the tree? 


(5) cuchabe cáni ndáaniti lari 
c.u-cha-belH caznil 
PROG.CAUS-fill=3. HUM PL=3.INAN PP one cloth 


ndaaniti lari 


‘He (was) putting them in a shirt: 


Prepositions in ZAL, if they are not borrowed from Spanish, are body part 
terms." In (4), the body part term lu ‘face’ is used as part of the prepositional 
phrase de lu yaga que “from on the tree” (lit. from face tree that”). In this case, 


"For more on the use of body-part terms in Zapotec languages, see e.g. MacLaury (1989) and 
Pérez Báez (2011). 
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Table 2.7: Elements whose order correlates strongly with that of verb 


and object (Dryer 2007) 


OV 


VO 


postpositions 

adpositional phrase - verb 
genitive - noun 

manner adverb - verb 
standard - marker 
standard - adjective 

final adverbial subordinator 
main verb - auxiliary verb 
predicate - copula 

final question particle 

final complementizer 

noun - article 

noun - plural marker 
subordinate clause - main clause 


relative clause - noun 


prepositions 

verb - adpositional phrase 
noun - genitive 

verb - manner adverb 
marker - standard 

adjective — standard 

initial adverbial subordinator 
auxiliary verb — main verb 
copula — predicate 

initial question particle 
initial complementizer 
article - noun 

plural marker - noun 

main clause — subordinate clause 


noun - relative clause 


the prepositional phrase is headed by the preposition de borrowed from Spanish. 
In (5), the body part term ndaani 'stomach' functions as the prepositional head 
of the phrase ndaani ti lari ‘in a shirt’ (lit. “stomach one shirt”). 


2.3.2.2 Adpositional phrase placed after the verb 


The examples in (4) and (5) demonstrate that the position of adpositional phrases 
is after the verb, as expected for a language whose basic order is VO. 
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2.3.2.3 Genitive follows the possessed noun 


As would be expected in a language with VO order, lexical genitives follow pos- 
sessed nouns in ZAI, as in (6): 


(6) cayaadxa ti dxumi  péra  badunguiiu 
ca-yaadxa' ti dxumil! pera badunguiiu 
PROG-be.missing one basket pear man 


“One of the man's baskets of pears was missing. 


In the complex subject NP, ti dxumi pera badunguiiu, the lexical genitive badun- 
guiiu ‘man’ appears after the possessed noun ti dxumi pera ‘a basket of pears. 
In addition, possessive pronoun clitics also follow possessed nouns, as in (7): 


(7) bidxí'babe lú  xpiciclétabé 
bi-dxi'Hba-belH lu x-bicicleta=be!™ 
COMPL-climb.up-3.HuM face poss=bicycle=3.HUM 


“He got on his bicycle: 
Here, the third-person subject clitic -be appears as an enclitic on the possessed 
noun bicicleta ‘bicycle’, to which the possessive prefix x- attaches. 
2.3.2.4 Manner adverbs follow the verb 


Manner adverbs may follow the verb, as in (8), where the adverb nachaahui' ap- 
pears after the verb: 


(8) biluxebe nachaahui’ 
bi-luxe-belH na-chaahui’ 
COMPL-finish=3.HUM STAT-well 
“S/he finished well. 


They may also attach directly to the end of the verb root, as modifiers, as in (9): 


(9) gátachaahui ira guétabaadxi cá 
gH-a'ta-chaahui' guiralH guetabaadxi cal! 
IMP-lay-well all tamal DEM 


'Lay down all the tamales carefully: 


In example (9), the verb root a’ta ‘lay down’ contains a glottalized vowel that is 
pronounced when stressed. In this case, the adverb chaahui’ appears immediately 
after the verb root and stress falls not on the verb root but on the adverb, as it is 
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the rightmost element of the verbal constituent. Stress lenites in all elements to 
the left, as we saw in 82.2.3. 

There are, however, cases in which an adverb may appear before the verb, as 
in (10): 


(10) nachaahui bíluxebé 
na-chaahui’ bi-luxe-belH 
STAT-well compt-finish=3.HUM 


‘S/he finished WELL: 


Cases such as this occur when information carried by the verb is presupposed and 
the manner adverb is asserted, or focused (cf. 8). These cases are pragmatically- 
marked in the sense of Payne (1995), as I will explore in 82.3.5. 

Variation in the relative position of main clause and adverbial clause is com- 
mon in ZAI, as in many languages. Conditional clauses, for example, exhibit a 
universal tendency to precede the main clause (Haiman 1978). In this study, I 
consider this variation to be related to discourse pragmatics and to the commu- 
nication of topical information. This will be explored in more detail in Chapter 6, 
where the issue of subordinate adverbial clauses will be tied closely to the anal- 
ysis of the LA particle, which is the topic of $6.2. 


2.3.2.5 Order in comparative constructions is adjective-marker-standard 


The comparative construction currently used in ZAL with the order adjective- 
marker-standard, is a construction borrowed from the Spanish más que. An ex- 
ample is shown in (11): 
(11) jmá nahuinni jfaabe qué bixhozebé 

jma! na-huinni jüaa-belH que bixhoze-belH 

more STAT-small mother=3.HUM than father-3. HUM 

‘His/her mother is younger than his/her father. 
The order here is adjective-marker-standard. The native ZAI comparative con- 
struction has not yet been documented. However, in San Lucas Quiaviní Zapotec, 


a central Zapotec language, the native comparative construction appears to also 
have an adjective-marker-standard order (Galant 2006), as in (12): 


(12) Zyuüa'-ru'Lia Oli'ebloh Rrodriiegw. 
tall-ER Ms. Olivia than Rodrigo 
“Olivia is taller than Rodrigo’ 


It is likely that the native ZAI comparative construction would be similar. 
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2.3.2.6 Initial adverbial subordinator 


ZAI has a long list of adverbial subordinators, all of which have been borrowed 
from Spanish: ora, lugar de, ante, dede, cada, para, cumu, modo, sinuque, sin. All 
adverbial subordinators occur at the beginning of the subordinate clause. Some 
examples are: 

(13) órá ca lá, má áca licuárni 


ol H H LH LH 


ra cal! laH ma!  glHaca licua LH 


r=ni 
when DEM LA already ImP-become blend-3sG.INAN 
“At that time, blend it? 


(14) ante delas ócho  chuudü 
alH nte de las ocho ch-uu=dulH 


before of the eight  POT-g0=1PL.EXCL 


“Before eight we go. 


(15) púrti ma las ócho dela mañána  chuuzulu 
pulHrti ma las o-Hcho de la mañana chuu-zulu=@ 
because already the eight of the morning  POT.go-begin-3sG.INAN 


“Because already at eight in the morning it was going to begin? 


As with the comparative construction, it is likewise unclear what the native 
clause-combining strategy is; perhaps one of juxtaposition, but this is conjecture 
and requires further study. 


2.3.2.7 Auxiliary verb precedes main verb 


A minority of verbs can occur as auxiliary verbs. When they do, they appear 
before the main verb. One example is -anda “be able to’ in (16), followed by the 
main verb: 


(16) ¿zanda iganitu lá? 
guilH-ganilH- qjLH 14H 
FUT-be.able POT-be.silent=2PL Q 


z-anda 


“Can you (all) be quiet? 
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2.3.2.8 Copula precedes the predicate 


There is no copular construction in ZAI. However, nonverbal predicates occur at 
the beginning of the clause, as in the following example: 


(17) mecánico  laabé 
meca nico laa-belH 


mechanic BASE=3.HUM 


“He is a mechanic.’ 


2.3.2.9 Question particles 


Interrogative expressions in content questions in verb-initial languages most 
commonly occur at the beginning of sentences. This is true in ZAI as well. In 
the examples below, the question words panda ‘how many’ in (18) and pabia’ 
‘how much’ in (19) occur sentence-initially: 


(18) ¿panda  kilómetru  bixooñelu raqué? 
panda! kilometru bi-xooñe'=lu” | raquelH 
how.many kilometer COMPL-run=2sG then 


“How many kilometers did you run?” 


(19) ;pabiá ^ ruxoofelu ira dxí ya? 
pabia#  ru-xooñe-lu” guiral! dxi ya 
how.much HAB-run=2sc all day Q 


'How much do you run every day?' 


Yes/no question particles in verb-initial languages most often also occur at the 
beginning of the sentence. In ZAL however, such a particle is not obligatory and, 
in fact, is rarely used. The final particle LA is required in yes/no questions: 


(20) ;(üée)biiyalu laabe lá? 
ñee! bi-uuya-lu! ^ laa-belH la! 
Q COMPL-see=2SG BASE=3.HUM LA 
‘Did you see him/her?’ 


The question ¿ñée biiyalu laabe?, without the La particle, would be ungrammati- 
cal. 12 


One of the hypotheses examined in more detail in Chapter 6 is that the yes/no question particle 
LA is related to the La particle involved in the marking of topical information. 
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2.3.2.10 Initial complementizer 
There is no overt complementizer in ZAI. An example is shown in (21): 
(21) binadiaagá binda ti gaayu 

bi-nadiaaga-a'H bi-nda ti gaayu 

COMPL-hear-1sc COMPL-sing one rooster 


T heard a rooster sing: 


2.3.2.11 Article appears before the noun 


It is common for the article to precede the noun in VO languages.P There are 
no articles in ZAI. However, quantifiers such as ti 'one' (22) and ca PL (23) may 
precede the noun: 


(22 ti badunguiiu 
ti badunguiiu 
one man 


€ y 
one/a man 


(23) ca badunguiiu 
ca badunguiiu 
PL man 


€ > 


men 


Both of these NPs are indefinite. To mark definiteness, ZAI employs demonstra- 
tives, which must appear after the noun: 


(24) ti badunguiiu qué 


ti badunguiiu que! 
one man DIST 
‘that man’ 


(25) ca badunguiiu quě 
ca badunguiiu que 
PL man DIST 


LH 


‘those men’ 


BAn additional, though weaker, correlation is that articles appear to be somewhat more common 
in VO languages than they are in OV languages. 
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Unlike articles, the position of demonstratives does not exhibit a cross-linguistic 
correlation with respect to the order of object and verb. The use of demonstratives 
in discourse will be explored in more detail in Chapter 3. 


2.3.2.12 Plural marker - noun 


The plural marker ca always precedes the noun in ZAI, as was shown in (23). 


2.3.2.13 Main clause - subordinate clause 


Many languages, including ZAI, exhibit considerable freedom in the position of 
subordinate clauses. In some cases, adverbial subordinate clauses in ZAI can pre- 
cede the main clause, as was seen in (13)-(15). However, complement clauses fol- 
low the main clause, as shown here (cf. (26)-(27)): 


> 


(26) racaladxi Juán guéedá Míguél íxí 
ri-aca-ladxi Juan"! guH=eedal"! Miguel" guixi’ 
HAB=occur-gut Juan Por-come Miguel tomorrow 


H 


“Juan wants Miguel to come tomorrow: 


(27 na Juán biiya Miguél  caxcuidi qué 
na Juan” bi-uuya Miguel! ca xcuiPdi que 
say Juan Miguel cowPr-see PL child pist 


LH 


“Juan said Miguel saw the children. 


2.3.2.14 Noun - relative clause 


Almost all VO languages place the relative clause after the noun, as illustrated in 
(28). Here, the relative clause ni riree ndaani yuze ‘that comes out of the stomach 
of the cow’ follows the NP cuaju ca ‘the rennet’: 


(28) cuáju ca ni riree ndaani yúzé 
cuaHju calH ni ri-ree ndaani yulHzelH 


rennet DEM REL HAB-leave stomach cow 


“The rennet that comes out of the stomach of the cow? 


2.3.3 Summary of constituent order correlations 


The above discussion has shown that the great majority of the constituent order 
correlations in Table 2.7 conforms to a pattern of verb-object in ZAI. A summary 
of these correlations in ZAI and how they are manifested is presented in Table 2.8. 
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Table 2.8: Correlations between verb and object order in ZAI 


VO order correlations ZAI 

prepositions Y 

verb - adpositional phrase Y 

noun - genitive Y 

verb - manner adverb Variable, obeys discourse motivations 
marker - standard "4 (native construction unknown) 
adjective - standard "4 (“native construction unknown) 
initial adverbial subordinator Variable, obeys discourse motivations 
auxiliary verb - main verb "4 

copula - predicate No copula 

initial question particle Yes/no particle appears clause-finally 
initial complementizer Y 

article - noun No articles 

plural marker - noun Vv 


main clause - subordinate clause Variable, obeys discourse motivations 


noun - relative clause Y 


While the majority of the constituent order correlations discussed conform to 
cross-linguistic tendencies for VO languages, it is worth noting the exceptions 
here. First, there are no copula or articles in ZAI. Second, the principal rigid ex- 
ception is the yes/no question particle LA, which appears utterance-finally rather 
than, as would be expected for an VO language, utterance-initially. This particle 
will be analyzed in more detail in $6.2. Finally, several constituent order corre- 
lations show variation. We saw that in the cases of the manner adverb - verb or 
main clause - subordinate clause, the order obeys specific discourse motivations. 


34 


2.3 Clause structure and constituent order correlations in ZAI 


These motivations will be explored more fully in Chapters 5 and 6. The next sec- 
tion follows up this discussion of constituent order by focusing more specifically 
on the pre-verbal position, which we know to be a prominent position cross- 
linguistically and, in particular, in verb-initial languages. 


2.3.4 The pre-verbal position and rigidity in verb-initial syntax 


In her analysis of the pragmatic properties of verb-initial languages, Payne (1995) 
surveys the discourse functions that constituents may have in pre-verbal position. 
She groups these functions under the label “pragmatically marked”, that is, “in- 
formation which is to some degree counter to what the speaker assumes are the 
hearer's current expectations or presuppositions” (Payne 1995: 110). Payne argues 
that there exists a continuum for pragmatically marked (PM) information which 
includes, on one end, information that is contrary to hearer's assumptions and, 
on the other, information in accord with or only incrementally different from the 
hearer's expectations. Based on this observation, Payne proposes a hierarchy of 
pragmatic markedness, represented in Table 2.9: 


Table 2.9: A hierarchy of pragmatic markedness (Payne 1995: 479) 


more marked > less marked 


NP in descriptive or > NP establishing >  Pragmatically 
background clause a foundation marked NPs 


According to this hierarchy, if a verb-initial language places phrases before 
the verb to accomplish any function to the left on the following hierarchy, all 
phrases that accomplish functions to the right on the hierarchy will also occur 
before the verb. That is, among PM phrases, if a verb-initial language places a 
somewhat more marked phrase type before the verb, then it will also place less 
marked types before the verb. Languages that fall to the left on this hierarchy are 
clearly less rigidly verb-initial than are languages to the right. 

As will become clear from the following discussion, however, ZAl is not a 
rigidly verb-initial language. Indeed, all of the elements in the hierarchy - from 
descriptive and background clauses to pragmatically-marked NPs - can appear 
in pre-verbal position. I discuss the pre-verbal position in more detail in the next 
section as this is an important fact, and one that I will return to throughout 
the analysis in the remainder of this study. It will become especially relevant in 


35 


2 Background: the basic grammatical structures of ZAI 


Chapter 5 and Chapter 6, when I discuss the question of the relative "rigidity" of 
ZAI syntax and its relation to the types of topic and focus constructions available 
to ZAI speakers. 


2.3.5 The pre-verbal position in ZAI 


In rigid verb-initial languages, predicates also come first in clauses that are not 
temporally sequenced, but which serve to introduce and describe referents, state 
background conditions, or describe events that are out of sequence with the 
main event line (Payne 1995: 454). An almost universal strategy in verb-initial 
languages, however, is that if part of a sentence is questioned or is the answer to 
a question, it will come first. They are, in the words of Payne (1995), "pragmati- 
cally marked,” in the sense that initial position is associated with novel attention 
re-direction of some kind. The remaining constituents come at the end. 

The pre-verbal position has been identified as a privileged position from the 
perspective of information structure in other Zapotec languages as well. For ex- 
ample, Broadwell (2002) for San Dionicio Ocotepec Zapotec (Central Zapotec) 
and Lee (2000) for San Lucas Quiavini Zapotec (Central Zapotec) also identify 
the pre-verbal position as a topic or focus position. Similarly, Black (2000: 103), 
in her study of Quiegolani Zapotec (Central Zapotec) syntax, states, "Discourse 
analyses done on other Zapotecan languages show that the fronted nominal may 
be either old or new information” 

In addition to much of the data already explored above involving constituents 
in pre-verbal position (cf. adverbial clauses (13)-(15)); also, adjectives, as in (10)), 
the patterns described below provide further evidence that the pre-verbal po- 
sition in ZAI is indeed the locus for a variety of discourse functions, such as: 
question words, negation, focus of contrast (e.g. subject or objects NPs, adjec- 
tives), and initiation of new subsections of a text through the (re)introduction of 
participants. 


2.3.5.1 Pre-verbal position: wH-words 


As seen above in (18) and (19), the pre-verbal position is reserved for wH-words. 
Two additional examples are provided here in (29) and (30): 


(29) ¿xi bíf'nibé? 
xil" b-i'ni-be-H 
what COMPL-do-3sG 
“What did s/he do?” 
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(30) ¿tu bni ni? 
tul! b-i'ni nilH 
who COMPL-do 3.INAN 
“Who did it?” 


2.3.5.2 Pre-verbal position: negation 
Negation in ZAl always precedes the verb, as in (31): 
(31) qué reedabé guírá  dxí 
que! r-eedalH-pe" guira’ dxi 
NEG HAB-come-3sc all day 
'S/He doesn't come every day: (Pickett et al. 1998: 78) 


2.3.5.3 Pre-verbal position: focus of contrast 


Pickett et al. (1998) note that a core argument can be "emphasized" by placing it 
before the verb. In such constructions, if the argument is a full noun phrase, no 
co-referring subject clitic pronoun is found on the verb, as shown in (32): 


(32) Pédro  biiya ti badudxaapa 
PelHdrobi-uuya ti badu-dxaapa 
Pedro  COMPL-see INDEF child-woman 


“PEDRO saw a girl? (Pickett et al. 1998: 98) 


If the argument is a pronominal subject, however, a co-referring dependent pro- 
noun does appear cliticized to the verb, as shown here in (33): 


(33) laabe bí'yabe tí badudxaapa 
laa-belH b-iya-belH — ti badu-dxaapa 
BASE-3SG COMPL-see-3sG INDEF child-woman 
'S/HE saw a girl. (Pickett et al. 1998: 98) 


Additionally, a construction which places the object in pre-verbal position is also 
possible in ZAI. For example, in answer to the question “What did s/he do?” (29), 
one can respond: 


(34) dxiifia bi'nibé 
dxiiiia' bi-ini=be 
work  coMPr-do-3sc 
“S/He did WORK? 
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It is possible, also, to use a similar construction involving the discourse particle, 
NGA. 


(35) dxiiña nga bi'nibé 
dxiifia’ nga bi-ini=be H 
work NGA COMPL-do-3sc 
“S/He did WORK? 


In this case, the relevant interpretation is that of an exhaustive listing. A more 
detailed discussion of this particle will be taken up in §5.1.4. 

Although it is not clear what Pickett, et al. refer to by “emphasized”, it is clear 
that the use of an NP in pre-verbal position in each of these cases communicates 
discourse-pragmatic information. In Chapters 5 and 6, I analyze these construc- 
tions as "identificational" or "argument focus" constructions, where only a single 
NP is focused and the rest of the proposition is within the presupposition (Lam- 
brecht 1994: 228-233). As will be shown, in these cases, the NP in pre-verbal 
position is not necessarily “new” information, as it is not the focused noun it- 
self which contributes the new information to the discourse, but the relationship 
between (the referent of) this noun and the entire proposition. 


2.3.5.4 Pre-verbal position: left-dislocated phrases 


Finally, as will be discussed in more depth in Chapters 3 and 6, some nouns (in- 
cluding independent pronouns) appear in the pre-verbal position and are sepa- 
rated by the particle LA as well as by a pause in the intonation. These are left- 
dislocated phrases, i.e. phrases that occur under a separate intonation contour, 
and which may or may not be morphosyntactically related to the verbal case 
frame. If related, a resumptive reference may occur. These left-dislocated phrases 
often delimit a time, location, or some other conceptual frame of reference for 
what follows. By contrast, a non-dislocated pre-verbal phrase may or may not 
be related to the verbal case frame, but, if it is, a resumptive reference will likely 
not occur. 


2.4 Summary and research questions 


In summary, this chapter has described the main phonological and syntactic char- 
acteristics at the core of ZAI grammar. It was shown that ZAl is a tonal language, 
with high, rising and low contrastive tones and that these interact in complex 
ways with vowel phonation and a fortis-lenis distinction in consonants. It was 


38 


2.4 Summary and research questions 


also shown that stress and tone play a significant role in prosody beyond the 
word-level. Verb morphology is primarily agglutinative, in that there is no mor- 
phological case marking on nouns and there is no agreement between the verb 
and any of its arguments. I then reviewed the main patterns in constituent order 
relations in ZAI and showed that the most common arrangement of constituents 
in ZAI is considered to be verb followed by subject, then object. Finally, many fea- 
tures of ZAI are characteristic of verb-initial languages: adverbial subordinators 
are clause-initial; presence of prepositions rather than postpositions; adjectives 
generally follow nouns; possessive constructions are possessor final, etc. How- 
ever, verb-initial syntax is often violated as the pre-verbal position can be the 
locus for important discourse functions. 

With this background in mind, I devote the chapters that follow to an examina- 
tion of the interplay between verb-initial order, tone and prosody in ZAI. As has 
been pointed out, little has been said about the possible phonological, morpholog- 
ical and/or syntactic correlations with the expression of information structure in 
this language. From the preceding discussion, however, several questions arise 
that will guide the analysis with respect to four areas: 1) the relation between 
nominal forms and cognitive status; 2) constituent order; 3) discourse particles; 
and 4) prosody. I list these questions here: 


Nominal forms and cognitive status 


* How do the different morphological forms of nominals express different 
cognitive statuses? How does each cognitive status correlate formally with 
type of nominal expression? 


* To what extent do phonetic and intonational cues play a role in the expres- 
sion of cognitive status? 


Constituent order 


e Verb-initial syntax in ZAI is frequently violated in constructions in which 
topicalized and focalized elements may often appear before the verb. Since 
constituent order is known to have important discourse functions in many 
languages and since a small percentage of the world's languages are verb- 
initial, how does verb-initial syntax in ZAI condition the ways that speak- 
ers mark topic and focus? 


* Are constituent order changes a possible strategy for expressing all types 
of topic and focus constructions or only a subset? How pragmatically and 
syntactically "rigid" is the language? 
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Discourse particles 


e There are two discourse particles, La and NGA, that are involved in ex- 
pressing information structure in ZAI. Can the LA form be considered a 
contrastive topic marker? Is the NGA form involved in the realization of 
focused material? 


* In which cases is the use of these particles infelicitous? 
Prosody 


e If the realization of contour tones is tied to the realization of stress and of 
pauses, what is the distribution of stress and of pauses at the phrase- or 
discourse-level? Are they predictable? 


* Are stress and pauses involved in the realization of topic and focus struc- 
tures? Do topic and focus structures have a constant prosodic realization? 
In other words, is prosody involved in the realization of topic or focus? 


In the next chapter, I take the grammatical information presented here as a 
basis to address the first group of questions listed above with respect to ZAI 
nominal and pronominal forms, as well as their potential functions in discourse. 
In particular, I explore the ways in which different forms may signal different 
types of cognitive status, terms which will be illustrated below. 


40 


3 Preferred Argument Structure and the 


pragmatic status of nominal forms in 
Z Al 


In the study of information structure, it is necessary to make a distinction be- 
tween: a) the pragmatic states of the referents ofindividual sentence constituents 
in the minds of the speech participants, and b) the pragmatic relations established 
between these referents and propositions. The focus of this chapter is on the first 
of these. I will turn to the issue of topic and focus relations in Chapters 5 and 6. 


3.1 Preferred Argument Structure in ZAI 


This section is concerned with the relationship between the realization of nomi- 
nal forms and the syntactic role in which they appear. I will frame the analysis 
using Du Bois's theory of Preferred Argument Structure (Du Bois 1987; Du Bois 
et al. 2003; Du Bois 2003a,b), with two main goals in mind: 1) to observe the types, 
frequencies, and syntactic distributions of the nominal forms used by ZAI speak- 
ers to satisfy their discursive goals, and 2) to evaluate the capacity of Preferred 
Argument Structure to account for the patterns observed. 


3.1.1 Data and Methodology 


The data for this section are made up of narratives elicited from seven ZAI- 
Spanish bilingual adults between the ages of 25 and 45. To ensure comparability 
across this and Du Bois and others’ work, I asked the consultants to view the 
Pear film, a short 7-minute film designed for cross-linguistic comparison (Chafe 
1980), and then to afterward retell the plot of the story.! 


The four main characters in the Pear film are (the abbreviations follow Chafe 1980): Bike boy, 
Bike girl, Pear man, and the Three boys. The outline of the Pear Story is reproduced here from 
Chafe (1980: xiii-xiv) for convenience: 

The film begins with a man picking pears on a ladder in a tree. He descends the ladder, kneels, 
and dumps the pears from the pocket of an apron he is wearing into one of three baskets below 
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I administered the seven interviews and recorded the narratives in Juchitán. At 
the time of the interviews I had enough knowledge of the language to carry on 
basic conversations. The speakers I interviewed were all citizens of Juchitán who 
I saw and spoke to in Isthmus Zapotec on a daily basis and who made regular 
attempts to help me listen to and understand normal everyday speech. There- 
fore, although the situation was somewhat unnatural given my lack of native 
fluency in the language, I do not think this necessarily compromised the natu- 
ralness of the recorded narratives. I later transcribed the narratives myself and 
corroborated my transcriptions with a native ZAI speaker (not one of the seven 
participants). 

As mentioned in Chapter 2, I use the “intonation unit" (Chafe 1994) as the basis 
for the transcription as well as for the analysis below. I understand intonation 


the tree. He removes a bandana from around his neck and wipes off one of the pears. Then he 
returns to the ladder and climbs back into the tree. Toward the end of this sequence we hear 
the sound of a goat, and when the picker is back in the tree a man approaches with a goat on 
a leash. As they pass by the baskets of pears, the goat strains toward them, but is pulled past 
by the man and the two of them disappear in the distance. 

We see another close-up of the picker at this work, and then we see a boy approaching on 
a bicycle. He coasts in toward the baskets, stops, gets off his bike, looks up at the picker, puts 
down his bike, walks toward the baskets, again looking at the picker, picks up a pear, puts it 
back down, looks once more at the picker, and lifts up a basket full of pears. He puts the basket 
down near his bike, lifts up the bike and straddles it, picks up the basket and places it on the 
rack in front of his handle bars, and rides off. We again see the man continuing to pick pears. 

The boy is now riding down the road, and we see a pear fall from the basket on his bike. 
Then we see a girl on a bicycle approaching from the other direction. As they pass, the boy 
turns to look at the girl, his hat flies off, and the front wheel of his bike hits a rock. The bike falls 
over, the basket falls off, and the pears spill out onto the ground. The boy extricates himself 
from under the bike, and brushes off his leg. 

In the meantime we hear what turns out to be the sound of a paddleball, and then we 
see three boys standing there, looking at the bike boy on the ground. The three pick up the 
scattered pears and put them back in the basket. The bike boy sets his bike upright, and two 
of the other boys lift the basket of pears back onto it. The bike boy begins walking his bike in 
the direction he was going, while the three other boys begin walking off in the other direction. 
As they walk by the bike boy's hat on the road, the boy with the paddleball sees it. picks it 
up, turns around, and we hear a loud whistle as he signals to the bike boy. The bike boy stops, 
takes three pears out of the basket, and holds them out as the other boy approaches with the 
hat. They exchange the pears and the hat, and bike boy keeps going while the boy with the 
paddleball runs back to his two companions, to each of whom he hands a pear. They continue 
on, eating their pears. 

The scene now changes back to the tree, where we see the picker again descending the 
ladder. He looks at the two baskets, where earlier there were three, points at them, backs up 
against the ladder, shakes his head, and tips up his hat. The Three boys are now seen approach- 
ing, eating their pears. The picker watches them pass by, and they walk off into the distance. 
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unit to mean the stretch of speech occurring between two specific prosodic cues: 
an initial pause and a final lengthening. The reason for this is that intonation 
units have been shown to operate as fundamental units of cognitive processing, 
social interaction, and other domains, or in Chafe's words, as as representing a 
single "focus of consciousness" (see also Du Bois et al. 1993). Since intonation 
units tend to correspond very closely with simple clause structure, we will see in 
the vast majority of the examples below that the intonation unit tends to overlap 
with a core clause (i.e. a predicate plus its nominal arguments) in such a way that 
the arguments of a clause core fit within the single intonation contour delimited 
by the intonation unit.’ 

This study is based on a total of 346 clauses. The Pear Story was chosen as 
the method of elicitation because of its conduciveness to cross-linguistic com- 
parison. With the exclusion of first and second person arguments, the analysis 
concentrates on the variety and distribution of third person forms and involves 
a quantitative study of the nominal forms, as this allows verification of the recur- 
rent role and quantity tendencies predicted by Preferred Argument Structure and 
observed in the ZAI narratives. Given that there are no other existing linguistic 
studies of ZAI discourse, and despite a significant amount of poetry and litera- 
ture published in the language, the claims here are still preliminary and leave 
open the question of possible sociolinguistic variation in terms of variables such 
as genre or dialect. 


3.1.2 Evidence for PAS in ZAI 


In his theory of Preferred Argument Structure (PAS), Du Bois (1987; 2003a,b) 
makes specific correlations between discourse patterns and the form of the “core” 
arguments of the verb. Based on data from narratives in Sakapultek Maya, an 
ergative language spoken in Guatemala, Du Bois (1987) proposed the set of four 
closely related grammatical and pragmatic constraints at work in the distribution 
of arguments in spoken discourse shown in Table 3.1. 

Along the pragmatic dimension, the One New Argument Constraint reflects 
the tendency for no more than one core argument in a clause to contain new 
information. Another constraint, the Given A Constraint, states that this new 
information (typically expressed by full lexical noun phrases) freely appears in 
the intransitive subject position (the S role) or the transitive object position (the 


"There is, however, an important exception to this tendency in the ZAI data presented here. This 
is the case of “marked topics” or topicalized NPs set off in a separate preceding intonation unit 
without a verb, which are analyzed in $3.1.7.2. 
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Table 3.1: Preferred argument structure constraints (Du Bois 2003a: 34) 


Grammar Pragmatics 


Quantity Avoid more than one lexical ^ Avoid more than one new core 


core argument argument 
"One Lexical Argument "One New Argument 
Constraint” Constraint” 
Role Avoid lexical A Avoid new A 
“Nonlexical A Constraint” “Given A Constraint” 


O role), but not in the transitive subject position (the A role).? Parallel to this, 
along the grammatical dimension, the One Lexical Argument Constraint refers 
to the scarcity of clauses in which more than one core argument is expressed 
with a full noun phrase, the additional core arguments being expressed with pro- 
nouns or zero forms. Finally, the Non-lexical A Constraint reflects the tendency 
for speakers to freely realize full lexical noun phrases in the intransitive subject 
position (the S role) or the transitive object position (the O role), but strongly 
avoid placing them in the transitive subject position (the A role). 

Thus, the constraints on role refer to the avoidance of lexical/new transitive 
subjects and the constraints on quantity refer to the avoidance of more than one 
lexical/new argument in the same clause. The existence of these constraints has 
been supported by much empirical cross-linguistic research and this has been 
accepted by many as evidence that PAS is a universal feature of discourse. 

The strong tendency for new and lexical arguments to appear in S and O roles 
and to avoid the A role, though not a categorical rule, has been shown to occur 
widely in the spontaneous discourse of many typologically diverse languages 
(e.g. Hebrew, Sakapultek, Papago, English, Spanish, French, Brazilian Portuguese, 
Japanese, Achenese, Nepali, Finnish and Mapudungun) and in many genres and 
contexts (e.g. spoken, written, child interaction) (see Du Bois et al. 2003 and con- 
tents therein). That said, there are a number of studies that question the validity 
of PAS and its universality (see e.g. Haig & Schnell 2016 and Brickell & Schnell 
2017 for recent, well-structured, and insightful critiques.) 

As will be seen in the following discussion, the tendencies predicted by PAS 
do occur widely in third-person narratives in ZAI. Table 3.2 summarizes the dis- 
tribution across the core clause of full lexical noun phrases (LNP). 


>The term “core argument” is used in the sense of Dixon (1979), where A refers to the transitive 
subject, O to the transitive object, and S to the intransitive subject. 
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Table 3.2: Lexical arguments in core grammatical roles 


A role S role O role Total 


LNP 9% (19/201) 26%(52/201) 65% (130/201) 100% (201/201) 


Out of 201 total LNPs in the corpus, only 19 occur in the A role. The pattern 
of distribution of LNPs obeys the Non-lexical A constraint, as predicted by PAS. 
The majority of LNPs occur in the O role (65%), followed by the S role (26%) and 
finally the A role (9%). The rate of lexical mentions in the S role thus falls in 
between the rate of lexical mentions in the O and A roles. Du Bois (2003b: 37) 
reports similar patterns found in several other unrelated languages, as seen in 
Table 3.3:* 


Table 3.3: A cross-linguistic comparison of lexical arguments in core 
grammatical roles 


Language A role S role O role Total 

Hebrew 8% (18/232)  44%(103/232) 48% (111/232) 100% (232/232) 
Sakapultek 5% (11/218) 58% (126/218) 37% (81/218) 100% (218/218) 
Papago 10% (37/358) 47% (169/358) 42% (152/358) 100% (358/358) 
English 8% (21/257) 35% (90/257) 57% (146/257) 100% (257/257) 
Spanish 6% (35/591) 36% (215/591) 58% (341/591) 100% (591/591) 
French 5% (32/646) 45% (290/646) 50% (324/646) 100% (646/646) 
BrPortuguese 8% 39% 53% 100% 

Japanese 7% (48/661) 48% (320/661) 44% (293/661) 100% (661/661) 


One possible explanation for the scarcity of lexical As could be the scarcity of 
A positions that appear in the corpus. This does not appear to be the case, how- 
ever. Of the 346 total clauses attested, 149 (or 43%) are transitive (or ditransitive) 
clauses, a fairly common proportion in oral speech.” Table 3.4 shows that when 
we take the number of lexical As as a proportion of total As, the percentage is 
still very low. 


“The data for Sakapultek, Brazilian Portuguese, English and part of the Hebrew data are from 
narratives elicited from viewers of the Pear Story (Du Bois 2003a: 62-63). Du Bois does not 
report the exact number of tokens for Brazilian Portuguese. 

5“Generally one-third to one-half of clauses are transitive versus two-thirds to one-half in- 
transtive” (Du Bois 2003a: 63-64). 


45 


Preferred Argument Structure and the pragmatic status of nominal forms 


Table 3.4: Proportion of lexical arguments per argument position in 
ZAI 


A S O 
percent lexical arguments 13% (19/149) 32% (52/165) 77% (130/168) 


When viewed this way, the percentages also increase slightly for the S and O 
roles, but the relative proportion of each with respect to each other remains the 
same. That is, the PAS pattern is clear: the O role contains the highest proportion 
of lexical arguments, followed by the S role and finally the A role. 

The ZAI data also adhere to the two quantity constraints, the One Lexical Ar- 


gument constraint and the One New Argument constraint. This is illustrated in 
Table 3.5. 


Table 3.5: Percent of transitive clauses with 0, 1, and 2 lexical arguments 
in ZAI 


0 1 2 


percent lexical arguments 22% (33/149) 66% (98/149) 12% (18/149) 


Only 18 of the 149 total transitive clauses (12%) have more than one lexical 
argument. There are no clauses in the corpus which contain more than one new 
argument. 

Finally, with respect to new mentions, a new referent is introduced in A posi- 
tion only twice in the corpus, thus violating the Given A constraint on only two 
occasions. This is shown in Table 3.6: 


Table 3.6: Proportion of new arguments per grammatical role in ZAI 


A S O 


percent new arguments 1% (2/149) 11% (18/165) 21% (35/168) 


In short, we have seen thus far that the ZAI data patterns as predicted by PAS: 
lexical and new arguments are avoided in A position and the number of clauses 
with more than one lexical argument or new argument are very few. Because 
the number of new referents introduced and the number of clauses used by each 
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speaker will no doubt vary from speaker to speaker depending on factors such 
as genre or topic, one important issue related to the frequency of lexical and new 
As is what Du Bois terms “information pressure”: 


When a number of new protagonists are introduced within the space of a 
few clauses, the information pressure is higher than when fewer protagonists 
are introduced in the same number of clauses-or when the same number of 
protagonists are introduced in a longer sequence of clauses. (Du Bois 1987: 
834 (italics mine)) 


As Du Bois notes, the issue is especially relevant because in texts with low 
information pressure, few new or lexical mentions are likely in any grammatical 
role. Conversely, in texts with high information pressure, many new or lexical 
mentions are likely in any role. In this corpus, clauses with no lexical arguments 
are much less frequent than clauses with one lexical argument, as was shown in 
Table 3.5. 

Itis an open question, of course, whether this is a generalizable fact about ZAI 
narrative discourse. If we calculate the "Information Pressure Quotient” (IPQ) for 
the ZAI data, defined as the total number of new mentions divided by the total 
number of clauses, we end up with an IPQ of 0.159 (55/346). This IPQ is very 
similar to the one reported by Du Bois (1987: 834) for Sakapultek Maya, which 
translates to approximately one new introduction every 6.5 clauses. More likely, 
however, given the variation in the number of clauses per individual narrative 
(as high as 74 for one speaker and as low as 24 for another), the degree of in- 
formation pressure will differ depending on factors such as the genre, the topic, 
and the individual speaker. We would expect a different corpus with a different 
degree of information pressure to show a different proportion of clauses with 
one or zero lexical arguments. Crucially though, due to the two Quantity con- 
straints, we would not expect a higher proportion of clauses with two lexical or 
new arguments. 

Based on the quantitative data reviewed thus far and summarized in Tables 3.2- 
3.6, it appears that ZAI speakers conform closely to the PAS constraints proposed 
by Du Bois. But given the amount of cross-linguistic data that has been collected 
in support of the same discourse tendencies (see Table 3.3 as well as the studies 
in Du Bois et al. (2003), this does not come as a surprise. The question I would 
like to pursue in the next section is: Why? 
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3.1.3 PAS and the notion of Accessibility 


One of the important insights of PAS, then, has been that there is a cross-linguis- 
tic tendency for new and lexical arguments to avoid the A role and to appear 
most consistently in the S and O roles. Conversely, there is a tendency for old or 
given arguments to occur more commonly in the A and S roles. 

The question of what the underlying mechanisms are that might be respon- 
sible for the PAS patterns observed cross-linguistically is formulated succinctly 
by Haspelmath (2006: 910). He argues that while the majority of the research 
supporting PAS assumes the constraints in Table 3.1 as given, few of the existing 
studies question whether those constraints do not ultimately reflect other, more 
basic linguistic and cognitive mechanisms underlying discourse. 

Haspelmath points out two main issues with PAS. Most critically, he shows 
that there is a very close relationship between the constraints referring to lexical 
arguments and those referring to new arguments: new arguments tend to be coded 
with full lexical forms (a connection that was also noted by Du Bois 1987: 829-830). 
In Haspelmath's view, then, the four constraints could potentially be reduced to 
just one Quantity constraint and one Role constraint. 

Second, Hasplemath raises the important question of whether the Quantity 
tendencies do not follow straightforwardly from the Role tendencies. That is, if 
speakers avoid new and/or lexical As, they automatically avoid clauses with two 
new or lexical core arguments, because there are maximally two core arguments 
(A and O). Based on this, Haspelmath (2006: 911) suggests that “it may well be that 
the quantity maxims can be dispensed with, that is, the universally observable 
quantity tendencies are reducible to whatever explains the [Given A and Non- 
lexical A constraints]". 

So, what might explain the Given A and Non-lexical A constraints? These two 
constraints can arguably be based on the strong correlation between the A role, 
animacy and topicality. Because animates tend to be topical, and topical entities 
tend to be coded with non-lexical forms, the two constraints can be shown to be 
the result of more fundamental properties of discourse, without the need for any 
independent maxims. 

This is one ofthe main impulses behind a study by Everett (2009), who takes up 
Haspelmath's main criticisms and argues in favor of an explanation of the deeper 
generalizations behind the four PAS constraints. In particular, he argues, based 
on data from English and Portuguese, that the inherent tendency for the A role 
to be dissociated with lexical and new mentions is motivated by the tendency of 
the A role to be filled by human referents, which are inherently more topical, and 
for the S and O roles to be filled by non-human referents which are less topical. 
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The data in Table 3.7 show that the same holds for the ZAI data, at least as far as 
the A and O roles are concerned. 


Table 3.7: Percent human referents per core grammatical role 


A role S role O role 


percent human 99% (147/149) 88% (146/165) 32% (53/168) 


Although the percentage of human referents in the S role is very high, the 
point made by Everett still holds: As tend to be topical and represented anaphor- 
ically since they typically represent humans. Os should tend to be new and 
represented more frequently by lexical arguments since they typically refer to 
non-humans, which are generally non-topical. Ss represent a middle ground in 
that they present relatively less human referents than As (and therefore more 
lexical and new arguments), but more than Os. In other words, for Everett, the 
observed patterns in the proportion of lexical As, Ss, and Os can be reduced to 
the factor of human-ness." 

Here, I build on the arguments made by Haspelmath (2006) and Everett (2009) 
(as well as Haig & Schnell 2016 and Brickell & Schnell 2017) and claim that the 
underlying reasons for the PAS patterns observed cross-linguistically are related 
to basic discourse-functional factors, such as topicality and animacy. In contrast 
to those authors, I propose a different mechanism responsible for the PAS pat- 
terns, that is, that the fundamental mechanism driving the avoidance of new and 
lexical As in discourse can be shown to be one of accessibility (Ariel 1990; 2001). 
According to the view developed here, the fact that lexical and new referents 
tend to correlate with grammatical roles in certain predictable ways is due to the 
degree of accessibility of the referents that appear in the respective grammatical 
roles. 

In the rest of this chapter, I explore the idea that, because new referents are (al- 
most) always coded using lexical arguments, these tendencies can be accounted 
for using Ariel's scalar notion of accessibility (Ariel 1990; 2001): As tend to be 
highly topical and hence highly accessible and thus rarely new and rarely coded 
with full lexical forms; Os tend to be relatively non-topical and hence inaccessi- 
ble, frequently the locus of introduction for new referents, and thus often coded 


5In Everett's words, “Humans like to talk about humans” (Everett 2009: 21). 
"See also Haig & Schnell (2016) and Brickell & Schnell (2017) for further empirical, cross- 
linguisitic study and discussion in this direction. 
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using full lexical forms; Ss, frequently topical but also often the stage for new 
referents, form somewhat of a middle ground. 

Ariel (1990; 2001)'s scalar notion of accessibility is based on the premise that 
a nominal expression is best characterized as an instruction for the addressee to 
retrieve a piece of information from either the physical world or the discourse 
context by indicating how accessible or salient the particular piece ofinformation 
is to the addressee at that particular point in the discourse. From the perspective 
of accessibility, nominal expressions are actually accessibility markers” (Ariel 
2001: 31). 

How do nominal expressions indicate different degrees of accessibility? Ariel 
(2001: 32) claims that “the more informative, rigid, and unattenuated an expres- 
sion is, the lower the degree of accessibility it codes, and vice versa, the less infor- 
mative, rigid, and more attenuated an expression is, the higher the degree of ac- 
cessibility it codes”. In other words, different nominal expressions have different 
discourse functions because they are marked for different degrees of accessibility: 
less attenuated nominal expressions such as LNPs signal less highly accessible 
or less salient referents, while attenuated expressions such as pronouns or zeros 
signal more highly accessible or more salient referents. 

The possible link between Du Bois” theory of PAS and Ariel's Accessibility 
theory has been mentioned sporadically by the authors themselves, but to my 
mind has not been sufficiently developed. For example, Ariel (2001: 67) states: 


If the motivation [Du Bois] proposes for ergative and accusative markings is 
based on the lexical versus nonlexical distinction, then it is probably based 
on the consistently high degree of accessibility of agents versus the inconsis- 
tent degree of accessibility associated with intransitive subjects and objects, 
rather than on the given-new distinction between them. 


In later work, Du Bois (2006: 194) remarks that he "started thinking about PAS 
in terms of accessibility theory and, more specifically, the notion of topicality or 
salience in terms of high versus low accessibility" To my knowledge, however, 
this claim has not yet been forcefully stated in the literature. No detailed studies 
exist which explore the possibility that the deeper generalization behind the dis- 
tribution of new and lexical arguments in the A versus the S and O roles is this: 
accessibility and the cognitive costs associated with different types of nominal 
expressions. 

One goal, then, is to draw a firm connection between the degree of accessi- 
bility, the forms of nominal expressions, and the three core grammatical roles, 
S, A, and O. In short, the link between PAS and Ariel's notion of accessibility is 
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this: the O role tends to house low accessible referents that are coded with more 
linguistic material, such as LNPs. The A role tends to house highly accessible ref- 
erents that are coded with less linguistic material, such as zeros. The tendencies 
for the S role will be found somewhere between these two poles, tending more 
towards the O role in the marking of new information, but more towards the A 
role in contexts of topic continuity, i.e. the marking of topical or human elements. 
Therefore, I propose that the PAS tendencies can be represented graphically in 
terms of accessibility in the following way: 


(1) Accessibility and PAS 
LNP O 
( S 


Subject enclitic A 


low accessibility «= high accessibility 


Importantly, Ariel emphasizes that often more than one factor acts simultane- 
ously to affect the degree of accessibility- and thus the form- of nominal expres- 
sions. Several of the main factors involved are listed in (2): 


(2) Main factors involved in assessing the degree of accessibility? (Ariel 1990; 
2001) 

Number of previous mentions, i.e. number of new vs. old mentions 

Grammatical role, i.e. subject versus non-subject 

Animacy 

Degree of discourse salience or topicality, i.e. topics vs. non-topics 

Recency of mention 

Paragraph and frame boundaries, i.e. paragraph-initial positions 

such as episode boundaries 


moan E? 


Ihave already discussed several of these factors: two (number of new mentions 
and grammatical role) are directly mentioned in the PAS constraints, and two 


"This list is not an exhaustive one. For example, Ariel (2001: 50) emphasizes the role that pho- 
netic and intonational cues might play in marking the degree of accessibility of a referent. She 
mentions Mithun (1995) who shows how the same accessibility marker, a definite NP, can en- 
code different degrees of accessibility through prosodic cues: low degrees of accessibility are 
encoded by definite NPs which occur in separate intonation units, slightly higher degrees of 
accessibility are encoded by definite NPs which are not separated by any intonational cues, 
and high degrees of accessibility are encoded by definite NPs that occur in the more given 
syntactic position (in Central Pomo) with a specific, unmarked intonation. 
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(animacy and topicality) are factors that have been suggested to be fundamental 
in motivating those constraints (Haspelmath 2006; Everett 2009). The remaining 
two factors (recency of mention and episode boundaries) are taken up in 83.1.6 
and §3.1.8. 

In the remainder of the chapter, I analyze these accessibility factors with re- 
spect to the ZAI data and show that all of the factors, subsumed under the no- 
tion of accessibility, not only condition the forms of nominal expressions but 
also restrict their distribution to specific grammatical roles. I explore the extent 
to which the gradable notion of accessibility can be shown to underlie the PAS 
patterns in ZAI, by asking the following questions: 


e What types of accessibility markers occur in the corpus in each of the three 
grammatical roles? 


e What are the main accessibility factors involved in determining the distri- 
bution of nominal expressions across the three roles? 


* To what extent can the notion of accessibility, as a notion that encompasses 
at least the factors listed above in (2), sufficiently account for the patterns 
found in the ZAI data? 


To answer these questions, each argument in the Pear Story corpus was coded 
for the following five factors: 


(3) Coding scheme 

Form of reference: lexical, pronominal, or zero 
Core grammatical role: S, A, or O 

Animacy: human vs. non-human 


aos Pf 


Level of salience: New, Previous subject, Active, Old (see (4) for 
details) 
e. Appearance at episode boundaries 


This coding scheme includes each of the accessibility factors listed in (2). It is 
based on the coding scheme used by Arnold (2003) in her study of constraints 
on reference form in Mapudungan, but it differs in my formulation of the cate- 
gory Active (see (4) below) and in the inclusion of two categories: animacy and 
appearance at episode boundaries. To simplify the quantitative analysis, only 
matrix clauses were included in counting the number of referents that occurred 
in each of the three roles. Since one focus of this study is the distribution of 
zero versus overt third person reference forms, I did not want to include cases 
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where either type of mention was disallowed. A more detailed identification of 
the conditions under which one or other form is used is discussed in $4. For 
the purposes of the PAS study, however, subordinate and relative clauses, which 
were very infrequent, were excluded. Finally, given the special nature of “presen- 
tational” or “sentence focus” constructions (“out of the blue” constructions; cf. 
§6.1.1; §5.1.2) that typically appear at the beginnings of narratives, they have also 
been excluded. In the majority of cases, the speakers began the narrative with a 
transitive clause containing a LNP in both the A and the O role (e.g. cuchuugube 
ti rigola pera ‘A man is/was picking pears’). Since these types of constructions 
were not found in other parts of the Pear Story corpus, they are excluded from 
the analysis (except, of course, in the relevant sections dealing with the intro- 
duction of new referents) as they would otherwise have inaccurately biased the 
data. 


3.1.4. Accessibility and the introduction of new referents 


In ZAL singular indefinite referents are typically introduced using ti “one” fol- 
lowed by a noun phrase, as in ti xcuidi 'INDEF + child” or ti badunguiiu 'INDEF + 
man”. Plural indefinite referents are introduced with a quantifier such as cadxi 
'some' as in cadxi cuananaxhi 'some fruit'. Referents may also be introduced as a 
bare (uncountable) noun bicicleta ‘bicycle’ or within a possessor phrase such as 
lari stibe ‘his shirt’ (cloth + poss=3sc). 

Since new referents are referents that have not previously been introduced to 
the discourse, we would expect them to be referred to with the lowest accessiblity 
markers, lexical NPs (LNP). This is indeed the case, as is shown in Table 3.8. 


Table 3.8: Distribution of new mentions (all referents) by grammatical 


role 
A role S role O role Total 

Lexical NP 4% (2/55) 33% (18/55) 64% (35/55) 100% (55/55) 
(LNP) 
Dependent 0 0 0 0 
pronoun (DPR) 
Independent 0 0 0 0 
pronoun (IPR) 


53 


Preferred Argument Structure and the pragmatic status of nominal forms 


All new referents are introduced with a lexical NP. The main tendency is for 
indefinite NPs of the type ti badunguiiu 'INDEF + man' to be used to mark previ- 
ously “unidentifiable” and subsequently "activated" referents (Lambrecht 1994). 
This occurs in 58% (32/55) of the cases. In the remaining cases (42% (23/55)), NPs 
preceded by a quantifier, such as chonna badunguiiuhuiini “three + boys”, or bare 
NPs, such as pera “pear”, are used. 

As is predicted by PAS, the majority of new referents are introduced in the O 
role, followed by the S role, while only two new referents in the entire corpus are 
introduced in the A role. This pattern is expected, as is predicted by the graphic 
in (1): high accessibility markers such as LNPs tend to occur in the O role while 
low accessibility markers such as pronouns tend to occur in the A role. 

Interestingly, this pattern becomes skewed somewhat if we introduce the fac- 
tor of animacy and consider only the introduction of human referents. This is 
shown in Table 3.9. 


Table 3.9: Distribution of new mentions (human referents) by gram- 
matical role 


A role S role O role Total 


LNP 7% (2/28) 54% (15/28) 39% (11/28) 100% (28/28) 


When we factor in animacy, the proportion of new referents introduced in 
each role changes: now, the majority of new human referents are introduced in 
the S role, followed by the O role, and to a much lesser extent, the A role. The 
pattern found in Table 3.9 is due to the fact that human participants tend to 
be more salient and, hence, more accessible than non-human referents, which 
allows them to be introduced at a higher rate in the S role. 

Furthermore, a referent that is introduced in the S role, as opposed to the O role, 
marks that referent as subsequently more accessible.” This is perhaps most visible 
when we consider the types of human referents that were introduced in each role. 
For example, the most salient human participant in the Pear story around whom 
the majority of the action occurs is the bike boy, who was introduced exclusively 
in the S role. Meanwhile, the least salient human participant, the Bike girl, was 
introduced exclusively in the O role. 


"Du Bois (1987: 831) argues that the S role acts as a cognitive "staging area”. I come back to this 
idea below. 
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3.1.5 Accessibility and co-reference 


There are significant differences between the forms speakers use to introduce 
referents and the forms they use to track the referents through the narrative. 
Whereas new referents are always introduced using LNPs, the array of nom- 
inal forms available for coding non-new referents is wider. In this section, I 
present data showing that the nominal expressions ZAI speakers employ cor- 
relate with the accessibility factors of animacy, topicality, recency of mention, 
episode boundaries and, crucially, with grammatical role.!° The reason that spe- 
cific types of nominal forms strongly tend to occur in certain core argument 
positions is because they mark specific levels of accessibility. In particular, we 
find that low accessibility markers tend to avoid the A role and to occur most 
regularly in the O role, conversely, that high accessibility markers tend to avoid 
the O role and to occur most regularly in the A role. The S role, in contrast, tends 
to house high accessibility markers in contexts of topic continuity and low ac- 
cessibility markers in contexts of new or marked information. 

In the tracking of already-introduced referents, ZAI speakers have two basic 
anaphoric strategies available: lexical noun phrases (plus a demonstrative) and 
pronouns (see §3.1.7 for discussion). One of four demonstrative forms may appear 
after either a singular or a plural noun. The four-way distinction between prox- 
imal (for objects near to the speaker), mesioproximal (for objects near to the ad- 
dressee), mesiodistal (for objects away from both of both speaker and addressee 
but rather near), and distal (for objects far away from both) demonstratives is 
shown in Table 3.10: 


Table 3.10: ZAI demonstratives 


proximal ri? 
mesioproximal ca 
mesiodistal rica’ 
distal que 


Plural noun phrases are additionally marked using the plural marker ca as in 
ca badunguiiu que ‘those boys’ (PL + boy + DIST) or with a quantifier, as in chonna 
badunguiiu que ‘those three boys’ (three + boy + DIST). 


Tt is important to note, however, that although Ariel considers grammatical role a factor in 
accessibility marking (see (2b)), she does not make the distinction between subject of transitive 
verbs (A) and subjects of intransitive verbs (S). However, I believe that this distinction is critical 
in assessing degrees of accessibility, as we will see below. 
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Table 3.11 shows the distribution of each type of form per grammatical role. 


Table 3.11: Frequency of forms used for co-reference: LNPs + Demon- 
strative vs. Pronouns 


A S O Total 


LNP + DEM 12% (17/146) 23% (34/146) 65% (95/146) 100% (146/146) 
Pronouns 46% (130/281) 40% (113/281) 14% (38/281) 100% (281/281) 


Here we see that when we exclude new referents from the count, referents 
encoded with LNP + Demonstrative, i.e. a low accessibility marker, still occur 
most often in the O role (65%) and least often in the A role (12%). Within these, the 
proximal form is used only twice, the medial form only once, and the distal form 
never. The distal demonstrative is by far the most frequent. Also, as we would also 
expect, referents encoded with pronouns, i.e. high accessibility markers, occur 
most often in the A role (46%) and least often in the O role (14%). 

One additional piece of data worth commenting on here is the differential rate 
of lexical mention between the A and S roles that emerges in Table 3.11. It ap- 
pears that transitive subjects (As) are half as likely to be coded using a LNP than 
are intransitive subjects (Ss). As we saw in §3.1.2, Du Bois (1987) attributes this 
tendency to the One Lexical Argument Constraint (the tendency to use only one 
lexical argument per clause). According to Du Bois, this tendency was, in turn, 
due to the fact that As tend to be “given” or salient more often than Ss, resulting 
in a lower rate of lexical reference. As Arnold (2003: 237) argues, if this were the 
case, that if we hold salience constant, we would expect similar rates of lexical 
reference for A and S. Table 3.12 appears to show that this is not the case. The 
categories of salience we distinguish here are (in order of low to high accessibil- 
ity): New, Old, Active, and Previous Subject (further review and description of 
these categories will be covered in the next section, 3.1.6). 


Table 3.12: Lexical arguments for A and S at each level of salience 


New Old Act PrS 


A 100% (2/2) 43% (6/14) 21% (5/24) 6% (6/109) 
S 100% (18/18) 74% (23/31) 27% (6/22) 5% (5/94) 
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Here, the A role contains a significantly lower rate of lexical reference for 
the level of salience categorized as “Old” and a slightly lower rate for the level 
“Active”. Therefore, when salience is held constant, LNPs are still used more for S 
than for A. For Arnold, this is evidence that the One Lexical Argument Constraint 
cannot be explained based on discourse factors such as topicality. 

From the perspective of accessibility, however, this is not necessarily true. One 
of the reasons that the high rate of lexical arguments in the S role in “Old” con- 
texts is that more than 40% (10 out of 23) of the tokens are used to refer to non- 
human referents. In contrast, only 17% (1 out of 6) of the lexical arguments in the 
A role in “Old” contexts are used to refer to non-human referents. The data in 
Table 3.12 thus ignore the tendency for human referents to be more salient and, 
therefore, more likely to be transitive agents (i.e. the potentiality of agency scale 
Silverstein (1976) than non-human referents. For this reason, I suspect that the 
different rates of lexical arguments for S than for A are not due to the One Lexical 
Argument Constraint, as Arnold (2003) claims, but to the independent tendency 
for the A role to house human, highly salient and, therefore, highly accessible 
referents. 

At this point, it should be clear from this discussion as well as from Table 3.7 
(83.1.3) and Table 3.9 (83.1.4) that animacy strongly influences accessibility and, 
hence, the distribution of nominal expressions per grammatical role. In what 
follows, I examine the categories of full lexical noun phrases (LNP) and pronouns 
in more detail with respect to two additional accessibility factors, topicality and 
recency of mention (both captured under the label “salience”). 


3.1.6 LNPs and salience 


We would expect the two accessibility factors of topicality and recency of men- 
tion to correlate in predictable ways with the occurrence of LNPs. The effects of 
these two factors in the ZAI data can be observed through the use of the coding 
scheme for salience described in (4). 

I use the term salience here in the same sense as Arnold (2003) since it effec- 
tively combines two of the factors in (2), recency of mention and topicality. The 
result is a four level scale: 


(4) Salience of discourse referents (adapted from Arnold 2003: 231) 


New = New: referent is brand new to the text. 
Old - Old: referent had appeared previously in the text, but not in the 
previous three clauses. 
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Act - Active: referent was last mentioned as either the object of the 
previous three clauses, as a subpart of the subject or object in the 
previous three clauses, or both subject and object of the previous 
three clauses together.” 

PrS = Previous subject: referent mentioned as subject of the previous 
clause. 


This scheme allows us to observe how referential forms can be simultaneously 
affected by several discourse constraints. In particular, distinguishing between 
these four levels in this way allows us to measure differences in salience be- 
tween both recency of mention (by comparing "Previous Subject" with "Active" 
and "Old") and topicality (by comparing "Previous Subject" with "Active"). I thus 
assume salience to be a gradable scale (Hopper & Thompson 1980) - where ref- 
erents can be more or less salient- and for the relative values on this scale to 
coincide directly with those on the scale of accessibility - where referents can 
be more or less accessible. 

First, with respect to recency of mention, reference to something in the previ- 
ous three clauses (“PrS” and “Act”) is less likely to be encoded with a LNP than 
reference to something prior to those three clauses (“New” and “Old”). This is 
shown in Table 3.13. 


Table 3.13: Percent of LNPs and recency of mention 


Reference to: ?; lexical 


Entities in the previous clause or previous three clauses (PrS 26% (53/201) 
* Act) 
Entities prior to three clauses (New + Old) 74% (148/201) 


Of all the LNPs in the corpus, three times as many occurred in "New" and “Old” 
contexts than in “PrS” and “Act” contexts. In other words, more recent mentions 
are less likely to be coded with a LNP than are less recent mentions. 

Second, with respect to topicality, reference to a subject (A or S) inthe previous 
clause or in the previous three clauses (PrS) is less likely to be encoded with a LNP 


"This category allows for the distinction between the relative discourse prominence of an an- 
tecedent that was mentioned in subject position and an antecedent that was mentioned in a 
non-subject position (Arnold 2003: 226). I have decided to adjust this category slightly from 
Arnold (2003: 231)'s formulation to include the previous three clauses (and not only the pre- 
vious clause), because I think it more accurately describes the patterns observed in the data, 
particularly the distribution of pronouns and demonstratives. 
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than reference to a non-subject in any of the previous three clauses (Act). This 
is shown in Table 3.14. Of the LNPs in the corpus, three times as many occurred 


Table 3.14: Percent of LNPs and topicality 


Reference to: % lexical 


Subject (A or S) of the previous clause or previous three 28% (15/53) 


clauses (PrS) 
Non-subject in any of the previous three clauses (Act) 72% (38/53) 


in "Act" contexts than in "PrS" contexts. That is, more topical referents are less 
likely to be coded with a LNP than are less topical referents. 

Based on these correlations as well as those we have set up between low de- 
grees of accessibility, LNPs and the O role on one hand and high degrees of 
accessibility, pronouns and the A role on the other, we would expect recency 
of mention and topicality to also correlate with grammatical role in the follow- 
ing way: referents that occur in the O role will be less topical and less recent 
(and coded as “New” or “Old”) while referents that occur the A role will be more 
topical and recent (and coded as "Previous Subject"). Table 3.15 shows that this 
pattern indeed holds for the ZAI data. 


Table 3.15: Frequency of referents in each category of salience 


A S O 


New 1% (2/149) 11% (18/165) 21% (35/168) 
Old 9% (14/149) 19% (31/165) 44% (74/168) 
Act 16% (24/149) 13% (22/165) 30% (50/168) 

( 
( 


PrS 74% (109/149) 57% (94/165) 5% (9/168) 
Total 100% (149/149) 100% (165/165) 100% (168/168) 


Conversely, we would also expect the majority of less topical and less recent ar- 
guments, such as those found in “New” and “Old” contexts, to occur in the O role 
and for the majority of more topical and more recent arguments, such as those 
found in “Previous Subject” contexts, to occur in the A role. This is also what we 
find, as shown in Table 3.16. The A role appears specialized for more topical and 
more recent mentions, while the O role is more specialized for mentions that are 


less topical and less recent. 
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Table 3.16: Frequency of referents in each category of salience 


A S O Total 


New 4% (2/55) 33% (18/55) 63% (35/55) 100% (55/55) 
Old 12% (14/119) 26% (31/119) 62% (74/119) 100% (119/119) 
Act 25% (24/96) 23% (22/96) 52%(50/96) 100% (96/96) 
PrS 51% (109/212) 44% (94/212) 5% (9/212) 100% (212/212) 


Finally, we would predict the tendencies shown in Tables 3.15 and 3.16 to corre- 
late with particular types of nominal expressions. That is, we would predict low 
accessibility markers such as LNPs to occur most often in contexts categorized 
as “New” and “Old” and high accessibility markers such as pronouns to occur 
most often in “Previous Subject” contexts. As Table 3.17 shows, this is also what 


we find. 


Table 3.17: Type of nominal expression per category of salience 


New Old Act PrS 


LNP + DEM 100% (55/55) 88% (93/106) 53% (38/68) 79. (15/202) 
PRONOUNS 0% (0/55) 12% (13/106) 47% (30/68) 93% (187/202) 
( 


Total 100% (55/55) 100% (106/106) 100% (68/68) 100% (202/202) 


The inverse relation that exists between degrees of salience (defined in terms 
of topicality and recency of mention) and rates of LNPs should be clear: a high 
degree of salience and accessibility correlates with a low rate of LNPs and a low 
degree of salience and accessibility correlates with a high rate of LNPs. Further, 
the relation should also be clear between high rates of LNPs and the O role as 
well as low rates of LNPs and the A role. In the next section, I analyze the re- 
lation between degrees of salience and the distribution of higher accessibility 
expressions, i.e. pronouns. 


3.1.7 Pronouns and salience 


The ZAI pronominal system is summarized in Table 3.18. This system does not 
distinguish between masculine and feminine, or between formal and informal. 
The third person pronoun differentiates between human, animal, and inanimate. 
In addition, first person plural distinguishes between inclusive and exclusive. 
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Table 3.18: The ZAI pronominal system 


Dependent form Independent form 


1sG -a naa 

2sG -lu' lii 

3sG.HUM  -be, -Ø laa-be, laa-D 
3sG.ANIM  -me, -Ø laa-me, laa-D 
3sGINAN -ni, -Ø laa-ni, ni, laa-Ø 
IPLINCL -nu laa-nu 

1PL.EXCL -du laa-du 

2PL -tu laa-tu 

3PL.HUM  -ca-be, -ca-Ø laa-ca-be, laa-ca-D 
3PL.ANIM  -ca-me, -ca-D laa-ca-me, laa-ca-D 
3PLINAN  -ca-ni, -ca-D laa-ca-ni, -cani, laa-ca-D 


Although NPs are not marked for case in ZAL pronouns do have independent 
and dependent forms that are sensitive to their grammatical position within the 
clause. Dependent forms occur immediately after the verb or noun. In all other 
positions, the independent form is used which is comprised of a base form laa 
plus the dependent pronoun. For example, the third person singular pronoun can 
be realized as an overt form or as a zero form and, when used in object position, 
before the verb, or in isolation, the pronoun base laa carries the dependent pro- 
noun. The dependent forms mark already activated referents, i.e. they mark 
continuing topics. These forms do not mark the same contrasts as the indepen- 
dent forms, which can function as either topical or focal expressions. In a canon- 
ical verb-inital clause, pronouns in the S and A roles appear in the dependent 
form as enclitics on the verb. Pronouns in the O role occur in the independent 
form after the subject. 

In the remainder of this section, I focus on two main distinctions that appear 
in (3.1.7). First, I compare the distribution in the Pear Story corpus of the overt 
third-person singular dependent form, -be, to that of the zero form, =Ø. Second, 
I analyze the distribution of dependent pronouns versus independent pronouns. 


“The option to use an independent form for the A or S role, as in the case of “marked topics”, 
does exist. These cases will be discussed in more detail below. 
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3.1.7.1 Distribution of third-person dependent pronouns: overt vs. zero 


In simple intransitive (5 — 6) or simple transitive constructions (7 - 8), the choice 
between the overt or the zero form of the pronominal subject clitic is free (Marlett 
& Pickett 1996): 


(5) biababe láyu 
bi-aba-bel! layu 
COMPL-fall-3sc ground 
'S/he fell on the ground: 


(6) biaba layu 
bi-aba=® layú 
COMPL-fall-3sc ground 
'S/he fell on the ground: 


(7) biiyabe bá'du qué 
bi-iya-belH ^ ba'du que! 
COMPL-see=3sG child DIST 
“S/he saw the child. 


(8) biiya ba'du qué 
bi-iya=@ ba'du' quel 
COMPL-see=3sG child DIST 
“S/he saw the child’ 


The intransitive clauses in (5) and (6) convey the same propositional content. 
However, whereas in (5) the S role is occupied by the overt third person pronoun 
=be, in (6) the S role is occupied by the zero form. This alternation is possible in 
transitive clauses as well, as is shown in (7), which contains the overt form, and 
(8), which contains the zero form. 

If the choice between the two forms is indeed free at the level of the main 
clause, it is important to consider the discourse conditions are under which each 
of the two forms is used. One possibility is that the distribution of the forms is 
conditioned by grammatical role. This possibility is explored in Table 3.19. 

What emerges from this table is a strong preference for overt marking. How- 
ever, although there may be a slight preference for the overt form to appear in 
the A role, there does not seem to be a significant difference between the two 
forms in the grammatical role with which they are associated. 
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Table 3.19: Frequency of third-person singular overt vs. zero DPR per 
grammatical role 


A S 


=bé 78% (73/93) 73 % (58/79) 
=D 22% (20/93) 277 (21/79) 


Total 100% (93/93) 100% (79/79) 


Table 3.20: Frequency of third-person singular overt vs. zero for each 
level of salience 


New Old Act PrS 
=bé 0 73% (8/11) 90% (18/20) 74% (106/144) 
=Ø 0 27%(3/11) 10% (2/20) 26% (38/144) 
Total 0 100% (11/11) 100% (20/20) 100% (144/144) 


A second possibility is that the distribution of the overt versus the zero form 
correlates with one or more levels of salience. This is represented in Table 3.20. 

These data show that zero forms are much more restricted in terms of the 
degree of salience compared to the overt forms. That is, while overt pronouns 
may occur somewhat freely at each level of salience (except, of course, for "New"), 
zero pronouns appear to be much more restricted to “PrS” contexts- there are 
only five total uses of the zero form outside of “PrS” contexts. 

Here for “Old” and *PrS” the distribution is very similar to Table 3.19 (in PrS 
it is basically identical). Only the numbers reported for “Act” stand out. This 
pattern would appear to imply that topicality and not recency of mention is the 
crucial factor in whether the zero form is employed. That is, the use of the zero 
form, higher on the accessibility scale than the overt form, is restricted to highly 
topical referents, whereas the overt form may be used for either highly topical 
or recently mentioned referents. For the purposes of this section, I leave this 
question unresolved for now and return to it in Chapter 4, where I argue that 
the distribution of the two forms is related to a distinction between primary and 
secondary topic. 
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3.1.7.2 Independent pronouns in the A or S role 


While the most common way to refer to subjects in the A or S role is through 
the use of dependent pronouns, it is also possible in ZAI to use an independent 
pronoun in pre-verbal position. These are cases that Du Bois terms "marked top- 
ics”: “NPs which are topicalized and set off in a separate intonation unit without 
a verb, and usually precede a predication about the same referent in the immedi- 
ately following clausal intonation unit" (Du Bois 1987: 814, note 11).? In the ZAI 
data, there are 25 instances in which an independent pronoun is used in this way. 


Consider the following example: 


(9 01 biabandabé 
bi-abanda=be!H 
COMPL-fall.hard-3sc 
‘He fell? 

02 biréeche dxümí  pérá  stí-bé 
bi-ree-H che-H. dxumil pera st H- peLH 
COMPL-spill basket pear  Poss-3sc 
‘His basket of pears spilled? 

03 laabe lá, 


laa-bel.H laH 
BASE-3SG LA 
“As for him; 

04 biiyadxisibé bádudxaapahuiini qué 
bi-iyadxisiH -peLH badudxaapa-huiini quel 
COMPL-see.fixedly-3sc girl-DIM DIST 


“He looked fixedly at the little girl? 


Here, the subject of the intransitive verb in the first intonation unit is the bike 
boy and the subject of the intransitive verb in the second intonation unit is the 
basket of pears. In the immediately following intonation unit, line 3, the third 
person singular independent pronoun is used to refer to the bike boy, followed 
by the particle La.!* The marked topic in the third line therefore helps to signal 


PImportantly, for the purposes of coding the data, marked topics were treated as one mention 
(of an independent pronoun), not two mentions (one independent pronoun plus one dependent 
pronoun). 

“The LA particle always appears at the end of an intonation unit. It appears in 23 of the 25 tokens 
in which the marked topic strategy is used. It also appears consistently at the end when-clauses 
and if-clauses. One possibility, then, is that it is used as a topic or contrastive topic marker. This 
issue will be taken up again in $6.2. 
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the change in subject from the basket back to the bike boy. The last intonation 
unit, line 4, consists of a transitive clause in which the A role is filled by the third 
person singular pronoun =be and the O role by a LNP that refers to the girl. Of 
the 25 instances in which this strategy is used in the corpus, 20 (or 80%) signal a 
change in subject from the previous sentence. 

Contrastive forms such as these are generally used in contexts where there 
is a switch in subject from the previous sentence because they signal referents 
that are not predicted to occur in particular roles. The account sketched here 
based on accessibility in fact predicts this to be the case. Ariel (2001: 37) states 
that, ^when an entity is not predicted to appear in a certain role, its degree of 
accessibility is (relatively) low” In other words, despite having the exact same 
form, marked topics with topicalized IPRs indicate a lower degree of accessibility 
(i.e. they signal a change in subject) than do IPRs in their more common O role 
position. 

To this point, I have tried to show that there exists a strong correlation in the 
ZAI data between nominal expressions such as LNPs, overt and zero dependent 
pronouns, and independent pronouns on the one hand, and certain grammati- 
cal roles (S, A, or O) on the other. Further, I have argued that the reasons for 
the strong correlation can be traced to different degrees of salience that are as- 
sociated with the grammatical roles in which the nominal expressions are used. 
Overt and zero dependent pronouns are preferred over LNPs in the S and A roles 
because those roles tend to house more salient referents. In contrast, independent 
pronouns and LNPs are preferred in the O role because of the tendency for the 
O role to house less salient referents. In the next section, I conclude this analysis 
by looking closely at one additional factor involved in the distribution of these 
nominal expressions across grammatical roles: episode boundaries. 


3.1.8 Episode boundaries 


Do speakers use different nominal forms according to different episode bound- 
aries? We can distinguish five main episode boundaries that each of the speakers 
marked in their narratives about the Pear film. These are listed in (10): 


(10) Five episode boundaries 
1 The Pear man is picking pears. 
2. The Bike boy passes by on his bike and steals a basket of pears. 
3. The Bike boy passes the Bike girl, hits a rock and falls. 
4. Three boys appear and help the boy get up and pick up the pears 
that spilled. 
5. The Three boys walk away, passing the Pear man by the pear tree 
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Out of the 35 episode boundaries in the seven narratives, 16 were marked with 
an intransitive clause and 19 with a transitive clause. 

Since low accessibility markers regularly occur in paragraph-initial positions 
such as episode boundaries (Ariel 2001: 52), we would expect clauses at episode 
boundaries to contain higher proportions of LNPs in the A and S roles than 
throughout the rest of the narratives. This is in fact the case. In Table 3.21, we 
see that the majority of the arguments (75%) that appear in the S role at episode 
boundaries are coded with a LNP. 


Table 3.21: Lexical arguments at episode boundaries, intransitive 
clauses 


new lexical S 75% (12/16) 
non-new lexical S 0 
non lexical S 25% (4/16) 


More significantly, all of the LNPs that occur in the S role at episode bound- 
aries introduce new referents. Moreover, of the 18 total new LNPs introduced 
in S position in the entire corpus, 12 (or 67%) occur at episode boundaries. This 
conforms to the observation by Du Bois (1987: 831) that the S position acts as a 
cognitive “staging area” for the introduction of referents that are later tracked 
through combinations of transitive and intransitive clauses. 

We also see a higher percentage of LNPs in transitive clauses at episode bound- 
aries. This is shown in Table 3.22. 


Table 3.22: Lexical arguments at episode boundaries, transitive clauses 


new lexical A non-new lexical A  non-lexical A 


new lexical O 0 1 11 
non-new lexical O 1 4 1 
non lexical O 1 0 0 


LNPs occur at a much lower rate in the A role than in the S role, even at episode 
boundaries. However, 7 of the 19 total As at episode boundaries are LNPs. This 
percentage (37%) is much higher than the percentage of lexical As found overall. 
In addition, it is interesting to note that of the two new lexical As that appear in 
the entire corpus, both occur at episode boundaries. 
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3.1 Preferred Argument Structure in ZAI 


In summary, LNPs in the A and S roles occur at a much higher rate at episode 
boundaries than they do at other parts of the narratives. I propose that the reason 
for this pattern can be also explained in terms of accessibility: episode boundaries 
are cross-linguistically very common sites for the occurrence of low accessibility 
markers (Ariel 2001: 52; see also Downing 1980). 


3.1.9 Summary 


The ZAI data patterns as predicted by PAS: lexical and new arguments are avoided 
in A position and the number of clauses with more than one lexical or new ar- 
gument is extremely rare. The question this chapter has been concerned with 
is: Why? Why should the four PAS constraints hold in ZAI, as well as cross- 
linguistically? How are they to be explained? Are the constraints discursively 
motivated? If so, what are these motivations? 

Other researchers (e.g. Haspelmath 2006; Everett 2009; Haig & Schnell 2016; 
Brickell & Schnell 2017) have pointed out, however, that the cross-linguistic ten- 
dency to observe these constraints may in fact be due to more fundamental gener- 
alizations about the nature of discourse. Three main observations stand out. First, 
there is a well-established correlation between human, topical referents and the 
A role in transitive clauses. Second, cross-linguistically what lexical arguments 
have in common with new arguments is that it is precisely full lexical forms that 
are used to introduce and track less-accessible (Ariel 1990) referents, i.e. new in- 
formation. This conforms to the more general observation in the literature that 
the use of more coding material, i.e. fuller nominal forms, correlates strongly 
with referents that are lower on the accessibility scale (Givón 1983). 

This chapter has presented discourse data from ZAI and has argued, in line 
with Haspelmath (2006), Everett (2009), Haig & Schnell (2016), and Brickell & 
Schnell (2017) that the constraints on new arguments and new As can be viewed 
as a subset of the constraints on lexical arguments and lexical As. I have proposed 
that the fundamental mechanism driving the tendencies captured by PAS can be 
traced to the notion of accessibility (Ariel 1990; 2001). This mechanism may be 
summarized as a reduction of the four PAS constraints to a single constraint that 
refers directly to the accessibility of referents in the A role: Avoid low-accessible 
As. In other words, the avoidance of new referents and LNPs in the A role can 
be understood as an avoidance of referents with a low degree of accessibility in 
that role. That this should be the case is natural given the factors involved in 
determining a referent's accessibility (as listed above in (2)): newly mentioned 
vs. already mentioned, non-subject vs. subject, animacy, topicality, recency of 
mention, and episode boundaries. 
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Highly accessible referents are referents that have already been mentioned, 
subjects, animate, topical, recently mentioned, and/or that do not tend to appear 
at episode boundaries. These are represented with relatively little coding mate- 
rial (Givón 1983). In contrast, low accessible referents are referents that are new 
mentions, non-subjects, inanimate, non-topical, not recently mentioned, and/or 
that tend to appear at episode boundaries. These are represented with relatively 
more coding material. Most significantly, this correlates with grammatical role: 
while highly accessible referents are very likely to occur in the A role, low acces- 
sible referents are very unlikely to occur in the A role. The correlations between 
accessibility factors, nominal expressions and grammatical role are summarized 
in Table 3.23. 


Table 3.23: Accessibility scale for ZAI nominal expressions 


low accessibility high accessibility 
accessibility factors newly mentioned non-subject already mentioned subject 

inanimate animate 

non-topical topical 

not mentioned recently recently mentioned 

at episode boundary not at episode boundary 
type of referring INDEF + LNP > LNP + DEM > IPR > overt DPR > zero 
expression 

O S A 


grammatical role 


These patterns are corroborated in the ZAI data presented above. On the one 
hand, new and/or lexical arguments are low on the accessibility scale and tend to 
be referred to using the forms ‘INDEF + LNP’ and ‘LNP + DEM”. These occur most 
commonly in the O role. On the other hand, already introduced referents are 
high on the accessibility scale and tend to be referred to using more attenuated 
pronominal forms. These occur most commonly in the S or A role. 

Interestingly, independent pronouns occupy a kind of middle ground, since 
they are usually used to refer to objects which tend to be less accessible than 
subjects, but, as in the case of “marked topics”, they can also be used to refer to 
subjects that are relatively less accessible, i.e. subjects that are not particularly 
salient at a certain moment in the discourse and/or subjects that occur at episode 
boundaries. The function of this construction in these cases is one of topic pro- 
motion (this construction will be an important part of the discussion of topic 
relations in $6). 
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Similarly, the S role also has an intermediate function between the O and A 
role. The S role will often house previously mentioned, animate, salient, topical, 
and recent referents but, as we saw, it also frequently functions as a “cognitive 
staging area” for the introduction of new referents at episode boundaries. 

In the next section, I move away from the analysis of Preferred Argument 
Structure and accessibility to examine the relationship between nominal forms 
and the pragmatic status of referents. 


3.2 Nominal forms and the pragmatic status of referents 


As we have seen throughout the course of this chapter, the forms of nominal 
expressions that speakers use depend on the assumed cognitive status of the ref- 
erents, that is, on assumptions that a speaker can reasonably make regarding 
the addressee's knowledge and attention state in the specific context in which 
nominal expressions are used (cf. Chafe 1976; Prince 1981; Ariel 1988; inter alia). 
Certain correlations therefore hold in ZAI between the formal category and the 
pragmatic status of the referents such that the lexical form of an NP may convey 
either: 1) a request to the hearer to act as if the NP were already pragmatically 
available or "given", albeit to varying degrees, or 2) a request to the hearer to 
act as if the NP constitutes unavailable or “new” information. The various nomi- 
nal forms in ZAI, namely independent and dependent pronouns, demonstratives 
and indefinite articles, indicate the status of their denotations as more or less 
activated in the speaker/hearer's mind, the discourse, or some real or possible 
world. 

Gundel et al. (1993) propose six cognitive (memory and attention) statuses rel- 
evant to the form of nominal expressions in discourse, which are implicationally 
related such that each status entails (and is therefore included by) all lower sta- 
tuses, but not vice versa. The statuses that an entity mentioned in a sentence may 
have in the mind of the addressee and their relation to each other is represented 
in the Givenness Hierarchy in Table 3.24: 


Table 3.24: Givenness Hierarchy (Gundel et. al.1993) 


in focus > activated > familiar > uniquely identifiable > referential > type identifiable 


“Depending on where the referents or corresponding meanings of these linguistic expressions 
are assumed to reside" (Gundel & Fretheim 2001: 177). 
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Each status on the hierarchy is a necessary and sufficient condition for the ap- 
propriate use of a different form or forms. In using a particular form, a speaker 
signals that s/he assumes the associated cognitive status is met and, since each 
status entails all lower statuses, s/he also signals that all lower statuses (the sta- 
tuses to the right) have been met (Gundel et al. 1993: 275). For example, any- 
thing in focus is also activated, anything activated is also familiar, and so on, but 
something that is familiar is not necessarily activated or in focus. The statuses 
are therefore ordered from most restrictive (in focus) to least restrictive (type 
identifiable), with respect to the set of possible referents they include. These are 


summarized in (11): 


(11) Six cognitive statuses proposed by Gundel et al. (1993) 
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Type identifiable. The addressee is able to access a representation of 
the type of object described by the expression. 

Referential. The addressee not only needs to access an appropriate 
type-representation, s/he must either retrieve an existing 
representation of the speaker's intended referent or construct a new 
representation by the time the sentence has been processed. 
Uniquely identifiable. In contrast to expressions which are 
referential but not uniquely identifiable, expressions which are both 
referential and uniquely identifiable require the addressee to 
construct or retrieve a representation on the basis of the nominal 
expression alone. Identifiability may be based on an already existing 
representation in the addressee's memory. 

Familiar. The addressee is able to uniquely identify the intended 
referent because he already has a representation of it in memory (in 
long-term memory if it has not been recently mentioned or 
perceived, or in short-term memory if it has). 

Activated. The referent is represented in current short-term memory. 
Activated representations may have been retrieved from long-term 
memory, or they may arise from the immediate linguistic or 
extralinguistic context. They therefore always include the speech 
participants themselves. 

In focus. The referent is not only in short-term memory, but is also 
at the current center of attention. Entities in focus generally include 
at least the topic of the preceding utterance, as well as any 
still-relevant higher-order topics. 


3.2 Nominal forms and the pragmatic status of referents 


The forms that encode statuses on the Givenness Hierarchy thus provide pro- 
cedural information about the manner of cognitive accessibility (or accessibility 
of representations of the intended referent) and thereby guide the addressee in 
restricting possible interpretations to ones whose status is explicitly indicated by 
particular forms. Furthermore, these hierarchical relations predict that a partic- 
ular form will be inappropriate if the required cognitive status is not met. 

Table 3.25 shows the correlations between pragmatic status and nominal forms 
in ZAIL 


Table 3.25: Correlations between linguistic form and pragmatic status 
in ZAI 


In focus Activated Familiar Uniquely Referential Type 


identifiable identifiable 
=bé independent NP + DIST ti NP ‘a NP’ 
=D pronoun ON 


NP + DEM 


Zero pronouns require that the referents be “in focus” while both dependent 
and independent pronouns require that referents be at least familiar. Indefinite 
NPs, in contrast, may require only that referents be type identifiable. 

The four-way distinction in demonstratives (proximal, mesioproximal, mesio- 
distal and distal) summarized in Table 3.10 is relevant here as well. As we saw, 
important differences occur in the Pear Story corpus with respect to how each 
demonstrative is used anaphorically to refer to already introduced referents. Of 
the 147 lexical NPs + DEM used this way, the proximal form ri’ is used only twice, 
the mesioproximal form ca only once and the mesiodistal form rica’ not at all. The 
distal demonstrative que is by far the most frequent, having been employed in the 
remaining 144 cases. What is interesting is that the few uses of the proximate and 
the mesioproximal demonstratives are limited to cases in which the lexical NP 
refers anaphorically to a referent mentioned within the previous three clauses, 
i.e more familiar or more activated referents. 

The above cognitive statuses generally correlate formally with type of nominal 
expression. As was shown, these statuses also have correlates in syntax, in par- 


‘Note that, based on further cross-linguistic investigation, Gundel et al. (2010) claim that: 1) if a 
language encodes the distinction between two adjacent statuses on the Givenness Hierarchy, it 
will also encode distinctions between higher statuses, and 2) all languages encode distinctions 
between the two highest statuses, ‘in focus’ and ‘activated’. 
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ticular, with the grammatical roles of core arguments. In short, the O role tends 
to house less activated or ‘new’ referents that are coded with more linguistic ma- 
terial such as Lexical NPs. The A role tends to house referents that are in focus 
(in the sense of Gundel et al. 1993) and that are coded with less linguistic material 
such as zeros. The tendencies for the S role are found somewhere between these 
two poles, tending more towards the O role in the marking of new information, 
but more towards the A role in contexts of topic continuity, i.e. the marking of 
topical or human elements. 

Finally, the cognitive status “in focus” has also been claimed to have prosodic 
correlates, i.e. phonological attenuation (Gundel et al. 1993: 285; but see also the 
cognitive category "activeness" in Lambrecht 1994; Ariel 1990; 2001). As men- 
tioned in 82.2.4 and discussed in more detail in $5, such correlates do not exist in 
ZAI, at least in the form of pitch accent. In this, it may be important to consider 
that, in Lambrecht's words: 


"While it is true that the referent of a pronominal expression or of a nom- 
inal expression spoken with attenuated pronunciation is always taken to 
be active..., it is NoT the case that an expression coding a referent which 
is assumed to be active is necessarily also spoken with attenuated pronun- 
ciation. In other words, weak prosodic manifestation is only a sufficient, 
not a necessary condition for assumed activeness of a discourse referent" 
(Lambrecht 1994: 97; EMPHASIS in original). 


For Lambrecht, then, the link between attenuated pronunciation and/or pro- 
nominal marking and highly activated referents represents the unmarked or de- 
fault case whereas, in more “marked” environments, these same referents may 
receive emphatic pronunciation and be coded using fuller nominal forms. 

Similarly, Ariel (2001: 50) emphasizes the role that phonetic and intonational 
cues might play in marking the degree of accessibility of a referent. She cites 
Mithun (1995) who shows how the same accessibility marker, a definite NP, can 
encode different degrees of accessibility through prosodic cues: low degrees of ac- 
cessibility are encoded by definite NPs which occur in separate intonation units, 
slightly higher degrees of accessibility are encoded by definite NPs which are not 
separated by any intonational cues, and high degrees of accessibility are encoded 
by definite NPs that occur in the more given syntactic position (in Central Pomo) 
with a specific, unmarked intonation. 

In the next chapter, I leave behind the relation between grammatical role, ac- 
cessibility and pragmatic status, which I will come back to in $6, and I continue 
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with the analysis of ZAI nominal forms by focusing on the alternation and dis- 
tribution of overt and zero third-person clitics that was mentioned in §3.1.7. 


3.3 Summary and conclusions 


This chapter explored the relationship in ZAI between form and distribution of 
nominals by function, focusing on the ways that the different forms are used 
to introduce and track referents and to mark referents as more or less accessible. 
Through the lens of Preferred Argument Structure (Du Bois 200323) and the theory 
of Accessibility (Ariel 2001), the chapter argued that the fundamental mechanism 
driving the PAS tendencies can be traced to the notion of accessibility. 

More specifically, one of the tendencies identified by PAS, the avoidance of 
new referents and lexical NPs in the A role, was understood as an avoidance 
of referents in the A role with a low degree of accessibility. More directly, the 
tendency is: Avoid low accessible As. This is because, as we saw, highly accessible 
referents with less coding material are more likely to occur in the A role. In 
contrast, low accessible referents with characteristically more coding material 
are unlikely to occur in that role and more consistently occur in the O role instead. 
The S role, for its part, exhibits a tendency in between the A and O roles. On the 
one hand, it can house previously mentioned, animate, salient, topical, and recent 
referents. On the other hand, it can house new referents at episode boundaries, 
thereby functioning as a “cognitive staging area” (cf. 83.1.8). 

In summary, the A role tends to house referents that are 'in focus' (Gundel et 
al. 1993) and coded with less linguistic material, and the O role houses referents 
that are less activated or “new” and coded with more linguistic material. The S 
role tends more towards the O role in contexts of marking new information and 
more towards A role in contexts of topic continuity. 

Furthermore, we saw that there is a relation between the grammatical role of 
core arguments, accessibility, and cognitive or pragmatic status. In other words, 
cognitive status correlates with type of nominal expression, as well as with the 
grammatical roles of core arguments. These correlations were summarized in 
Table 3.25. This occurs because nominal forms indicate the status of their denota- 
tions as more or less activated in the speaker or hearer's mind, as pragmatically 
more or less available, such that the forms of nominals that speakers use de- 
pend on the assumed cognitive status of the referents involved. That is, nominal 
forms depend on assumptions that a speaker can reasonably make regarding the 
addressee's knowledge and attention state in the specific context in which the 
form is used. 
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4 Nominal forms in discourse: the 
alternation of third-person singular 
pronouns 


As mentioned previously in §3.1.7, Table 3.18, third-person dependent and inde- 
pendent pronouns both alternate between an overt form (=be) and a zero form 
(=Ø). Because the choice between the overt and the zero form is free at the main 
clause level in both transitive and intransitive constructions, an explanation of 
the differential distribution between the two requires a more detailed syntactic 
and pragmatic analysis. This is the subject of this section, which begins with a 
discussion of the syntactic facts constraining the distribution of either pronom- 
inal form and then moves to an analysis of the discursive motivations involved 
in their use. In order to offer a more complete view, in addition to the Pear Story 
corpus, the analysis here also draws from previously published studies, from data 
collected using elicitation techniques, and from spontaneous dialogue. 


4.1 Syntactic constraints on the overt versus zero 
alternation 
The zero form has a more constrained syntactic distribution than the overt form, 


that is, the zero form has a narrower set of binding conditions. This can be ob- 
served in the case of reflexives and dependent clauses. 


4.1.1 Reflexives 


The reflexive consists of the word laaca 'same' followed by an independent pro- 
noun co-indexed with its antecedent. The zero pronoun is bound by a full NP 
antecedent (1) or another zero pronoun (2): 


Nominal forms in discourse: the alternation of 3rd-person singular pronouns 


(1) biiya Bétu, laaca laa, 
bi-uuya — BelHtu laacalH laa=0 
COMPL=see Betu SAME — BASE-3 


“Betu saw himself? 


(2) biiya; laaca  láa, 
bi-uuya-O  laacal!!laa-O 
COMPL=see=3 SAME  BASE=3 
‘S/he saw himself/herself? 


Meanwhile, the overt pronoun can only be bound by another overt pronoun, as 
shown in (3)-(5): 


(3) biiyabe; láacá  láabé; 
bi=uuya=be!H laacal laa-bel 
COMPL=see=3.HUM SAME  BASE=3.HUM 
“S/he saw himself/herself. 


(4) biiya Bétu, (“laaca) laabé, 
bi-uuya Betu (laacalH) laa=belH 
COMPL=see Betu (SAME)  BASE-3.HUM 


‘Betu saw him/her (“himself)? 


(5)  biiyaz (‘laaca) laabé, 
bizuuya=0  (laacalH) laa-belH 
COMPL=see=3 (SAME)  BASE=3.HUM 
‘S/he saw him/her (himself): 


Therefore, the overt form can only co-refer with another overt form and a zero 
form can co-refer with either a full NP or a zero form, but not an overt form, 
within the main clause. A similar situation holds for dependent clauses. 


4.1.2 Dependent clauses 


An overt third-person pronominal subject in a dependent clause cannot co-refer 
to the subject NP in the main clause: 
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ES 


(6) racaladxi Bétuz guéedábé, ixi 
rizaca-ladxi Be“ tu gu’4=eedal4=beM guixi H 
HAB=occur-gut Betu  POT=come=3.HUM tomorrow 


“Betu wants him/her to come tomorrow. (MP 13)! 


The overt form in the dependent clause cannot refer to Betu. Instead, a zero form 
must be used (7): 


ES 


(7) racaladxi Bétu; guéeda, ixi 
ri=aca-ladxi Betu gulB=eedalH=g guixi'H 
HAB=occur-gut Betu PoT-come-3 tomorrow 


“Betu wants to come tomorrow. (MP 22) 


Identical pronominal forms obligatorily co-refer across dependent clauses, as in 
(8), (9): 

(8) racaladxibe, guéedábé, ixi’ 
ri-aca-ladxi-be"! gu H-eedal H-pelH guixi H 
HAB-Occur-gut-3.HUM POT=come=3.HUM tomorrow 
‘S/he wants to come tomorrow. 
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(9) racaladxi, guéedá, ixi 
ri-aca-ladxi-O ^ gu-eedalH-Q guixi'H 
HAB=occur-gut=3 POT-come-3 tomorrow 


‘S/he wants to come tomorrow: 


They may both either be overt or both zero. In contrast, non-identical pronominal 
forms do not co-refer, as shown in (10), (11): 


(10)  racaladxibe; guéedá; ixi 

ri=aca-ladxi=be H gulH=eedalH=øØ guixi'H 

HAB=occur-gut=3.HUM POT=come=3 tomorrow 

‘S/he wants him/her to come tomorrow. (MP 34) 
(11) racaladxi> guéedábé; ixt 


ri-aca-ladxi-O gu=eedalH=bel! guixi'H 
HAB=occur-gut=3 POT-come-3.HUM tomorrow 


‘S/he wants him/her to come tomorrow: (MP 77) 


‘Tf the example is not from my own corpus, I refer to the source of the examples using the fol- 
lowing notation: MP= Marlett & Pickett (1996). The number that follows refers to the example 
number in the source. 
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Similarly, an overt third-person pronominal object in a dependent clause cannot 
co-refer to a previously mentioned NP in the main clause (12): 


(12) na Bétu; Yérmo biiya laabé3 
na BelHtuYelHrmo bi-uuya — laa-bel 
say Betu Yermo | COMPL-see BASE=3.HUM 


‘Betu, said Yermo, saw him...’ (MP 63) 
The zero form must be used for co-reference (13) 


(13) na Bétu; Yérmo biiya laa 
na BelHtu Ye! Hrmo bi-uuya  laa-O 
say Betu Yermo | COMPL-see BASE=3 


‘Betu, said Yermo, saw him.,. y. (MP 63) 
Based on evidence from reflexives and dependent clauses, then, we can say that 
the above generalization is true between a main clause and a dependent clause 


as well. That is, the overt form can only co-refer with another overt form and a 
zero form can co-refer with either a full NP or a zero form, but not an overt form. 


4.1.3 Adverbial clauses 


Similarly, the overt form in a pre-posed adverbial clause cannot refer cataphori- 
cally to an NP in the main clause (14): 


(14) óra  guéedábé, lá, ze Bétuz nisa qué 
olHra gul Hz eegaL H-pelH 14H z.e BelHtu nisa quel 
when POT=come=3.HUM LA FUT.drink Betu water DIST 
"When he, y y comes, Betu, will drink that water’ (MP 10) 


Here, the use of the overt form in the adverbial clause does not co-refer with the 
subject NP of the main clause. Instead, a zero form must be used (15): 


(15) órá  guéedá, lá, ze Bétu; nisa qué 
óra -eedal.H-Q la! z e LHty ni 
gu-eeda^ -2]la z.e Be" tu nisa que 
when Por-come-3 LA FUT.drink Betu water DIST 
"When he, comes, Betu, will drink that water” (MP 10) 


LH 


To be clear, between an adverbial clause and a main clause, the overt form will 
co-refer with another overt form and a zero form will co-refer with either a full 
NP or a zero form. 
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4.2 The overt versus zero alternation in a Pear Story monologue 


Having observed the various syntactic environments conditioning the use and 
co-reference of both the overt and the zero form, the following sections explore 
the choices that speakers make in assigning one or other of these pronouns to 
referents in discourse. 


4.2 The overt versus zero alternation in a Pear Story 
monologue 


In the following excerpt from a re-telling of the Pear Story, the speaker initially 
assigns the overt third person form to the man picking pears, line 04, and the 
zero form to the boy on the bicycle, line 08. However, in line 14, the overt form is 
now used to refer to the bike boy, in the moment he rides past a new participant, 
the bike girl (for clarity, the overt form is marked using [,] and the zero form 


using [5]): 


(169 01 bihuiini lu ni lá, 


LH 1 aH 


bi-huiini lu ni 
COMPL-appear face 3sG.INAN LA 


“There appears; 


02 ti rígola cuchuugu caadxi | cuánanaxhi 
ti riPlgola cuu=chuugu” —caadxil!! cuananaxhi 
one man | PROG.CAUS-Cut few fruit 


“a man cutting some fruit? 
03 rígola que lá, 

ri gola quel laH 

man DEM LA 


"Ihat man; 

04 má bichabe; chüpá dxúmí ní 
ma  bi-cha-be"! chupal! dxumi!! ni 
already COMPL.CAUS=fill=3.HUM two basket REL 
bichuugubé, 


bi=chuugu=bel#! 
COMPL=cut=3.HUM 
‘he had already filled two baskets of pears that he cut. 


05 raque  cúchabe;, guíra  péra 
raquel! c.u-cha-bel guiralH pelHra 
then  PROG.CAUS=put.in=3.HUM all pear 
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cuchugubé; 

cu-chugu-belH 

PROG=cut=3.HUM 

“Then he was putting in all the pears he was cutting’ 
06 dxí'babe; lú yaga qué 

dxiHba=beHĦ lu yaga quel 

climb-3.HuM face tree DIST 


H 


“(He was) up in that tree. 


07 qué fannadibé, bédanda tí 
que! ña-nnal!-di=be H be-dandalH ti 
NEG IRR=know-EMPH=3.HUM COMPL=arrive.there one 
xcuídihuiini 
xcuiHdi-huiini 
boy-DIM 


“He didn't know a boy arrived there? 

08 dxí'ba> ti bicicléta 
dxiHba-O ti  bicicleMta 
PART.climb=3 one bicycle 
“(He was) on a bicycle: 

09 gucaa ti dxumi  péra qué 
gu=caa=D ti dxumilH pera que 
COMPL=put=3 one basket pear DIST 
‘(He) put that basket of pears’ 

10 bidxi’ba, lu xpicicléta, 
bi=dxiba=0 lu x-bicicle"ta-o 
COMPL-climb-3sc face poss=bicycle=3 


LH 


‘(He) got on his bicycle: 
1 ne  bíree, Ze» 
nelH bi=ree=0 ze-Qo 
and COMPL=leave=3 PART.go=3 
“And (he) left? 
12 gula’na xcuídi que  dxümí  péra  stibé, 


gu=la'na  xcuidi quel! dxumilH pera sti H- pel H 
coMPL-stealboy DEM basket pear  Poss-3.HUM 


“That boy stole his basket of pears. 
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13 huaxa neza ze xcuídi que lá, 
huaxa neza z.e xcuildi que! laH 


but path PART.go boy DEM LA 


‘But on the path as the boy was leaving; 

14 málasi bidxagabe, tí badudxaapahuiini 
maHlasi!H bi-dxaga-belH ti  badudxaapa-huiini 
suddenly COMPL=cross=3.HUM one girl-DIM 
‘Suddenly he crossed a little girl’ 

15 dxí'ba; sti bicicléta 
dxiHba-Q sti —bicicle"ta 
PART.climb-3 other bicycle 


“(She was) on another bicycle? (Pear Stories TVA: 4-18)? 


Before line 14, the narrator refers to the bike boy using the zero form. After line 
14, the bike boy is referred to using the overt form. This switch in third person 
form announces or prepares the hearer for the introduction of the girl, who is 
thereafter referred to using the zero form. The bike boy, the most highly thematic 
participant, is referred to using the overt form for most of the remainder of the 
narration up until the very end, when focal attention is again paid to the pear 
man, who is then referred to using the overt form. 

This alternating use of the overt and zero third person forms to refer to dif- 
ferent characters in the Pear Story is consistent across the Pear Story corpus. 
The pear man is consistently assigned the overt form. The bike boy is initially 
assigned the zero form when he is introduced as a participant, is then assigned 
the overt form when the bike girl appears, and is then assigned the zero form 
again when the pear man returns to the scene. The bike girl and the boy with the 
paddleball are consistently referred to using the zero form. The use of the overt 
and zero forms across the Pear Story narratives can be summarized schematically 
this way: 

Again, this pattern is consistent across all of the Pear Story narratives in the 
corpus. The overt form is never used with either the bike girl or the boy with the 
paddleball. Conversely, the zero form is never used with the pear man. The use 
of the overt form coincides with the more thematic participant at each particu- 
lar juncture in the narrative. This is surprising given the strong cross-linguistic 
tendency for highly topical participants to be zero-coded, and for overt coding 
to signal a change of topic or indicate a less topical participant. In the Pear Story 


?See Appendix A. 
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Table 4.1: Third person forms assigned to Pear Story referents 


Overt form Zero form 


Pear man Y 

Bike boy "4 "4 
Bike girl Y 
Boy with paddleball Y 


narratives, therefore, ZAI speakers use the distinction between the overt and zero 
third person forms to assign referents varying degrees of thematicity. In the next 
section, I illustrate a similar use in conversation. 


4.3 The overt versus zero form in conversation 


In a similar way to the use in narratives described above, the overt-zero alterna- 
tion can be used productively in dialogue not only to distinguish between two 
third-person participants but also to mutually construe one as more or less the- 
matic than the other. The following example is taken from a conversation be- 
tween two men, VA and CH. VA is asking CH about his father and goes on to ask 
how long each of CH's parents lived. Note, in particular, the intervention in line 
06 by VA, where a zero third person form is assigned to CH's mother (again, for 
clarity, the overt form is marked using [1] and the zero form using [»]): 


(17) (VA and CH, 27 Sept 2012) 

01VA: panda íza bibani bixhozelu'? 
pandalH iza bi-bani ^ bixhoze-lu' 
how.many year COMPL=live father-2sc 
“How many years did your father live?” 

02 CH: nabanibe, cérca de ochénta 
na-bani-belH ceHrca de ochenta 
STAT-live-3sc.HUM close to eighty 


‘He lived close to eighty’ 


03 VA: xheelabe, ya? 
xheela'-belH ya’ 
spouse-3sG.HUM Q 
‘And his wife?’ 
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04 CH: 


05 CH: 


06 VA: 


07 CH: 


08 VA: 


09 CH: 


10 VA: 


11 CH: 


12 CH: 


4.3 The overt versus zero form in conversation 


xheelabe; lá, 
xheela'-belH laH 
spouse=3SG.HUM LA 


“His wife; 
laaca — gudi’dibe, séténta también 
laaca.H gu-di'di' -belH setenta tambien! 


also — coMPr-pass-3sG.HUM seventy also 


“she also passed seventy. 


ah, laa;  nírú güti; 
ah laa-Q niru gu-ti-o 
INTJ BASE=3 front COMPL-die-3 


“Ah, (she) died first. 
priméru  laabé, 
primelHru laa-belH 


first BASE-3SG.HUM 

“First him? 

ah laabe; má' gutibé, 

ah laa-belÀ ma  gu-ti-belH 


INTJ BASE=3SG.HUM already COMPL=die=3SG.HUM 
“Ah, he already died? 


priméru  laabé; 
prime'Pru laa-belH 


first BASE-3SG.HUM 

“First him? 

ah laabe; jmaca huaniisibe; qué jñaalu' 
ah laa-belH ¡maca huaniisi-bel que! jñaa=lu' 


ya”? 
ya 


INT] BASE=3SG.HUM more  old-3sc.duM M  mother=2sG Q 


‘Ah, he was older than your mother?” 

laabe, jmá huaniisibé, 

laa-be- jma! huaniisi-bel 

BASE=3SG.HUM more old-3sc.HUM 

“He was older: 

udi’dibe, lú binnigdla qué  zuluá 
gu-di di -belH lu binnigo™ a quel z.ului’=a’ 


H 


bia’ 
bia’ 


COMPL=pass=3sG.HUM face oldperson DEM  FUT.seem=1sG like 
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tapa iza 
tapa iza 
four year 
“He passed the old person, I think, by about four years. 
13 CH: peru udi dibe, zuluá' bia’ tapaiza lu 
peru gu-di' di'-bel z.ului'-a'H ^ bia'tapaiza lu 


but coMPL-pass-3sG.HUM FUT.seem=1SG like four year face 
jaa’ 
jñaa=a' 

mother-1sc 

‘But he passed my mother by four years? 
14CH: jmá huaniisibe;  xcáadxi 

jma huaniisi-bel! xcaadxi 

more old=3sG.HUM some 

‘He was a bit older? 
15 VA: ¿dxiiñara fiaa guzaabe, dé 

dxiiña ra faa gu-zaa-bel! de 
work roc field compi-complete=3sc.HUM from 
nahuiinibe, lá? 
na-huiini-belH ]aH 
STAT-small=3sG.HUM Q 


“Did he work in the fields since he was little?” 


In line 5, CH states that his father's wife, i.e. his mother, passed away when 
she was seventy. He refers to her using the overt form. In the next line, line 6, 
VA intervenes to ask whether his mother had passed away before his father, but 
refers to her using the zero form. In line 7, CH corrects VA and responds by saying 
primeru laabe “first him”, using the overt form to make clear that it was his father 
who passed away first, not his mother. In line 8, VA picks up on the use of the 
overt form and uses it again to check that he has understood correctly, saying 
ah laabe ma gutibe “ah, he already died”. In line 9, VA confirms this, repeating 
primeru laabe “first him”, using again the overt form to refer to his father. The 
use of the overt form to refer to the father continues throughout the rest of the 
interaction. 

One of the outcomes of VA's turn in line 6, then, is that the zero form is as- 
signed to refer to CH's mother and the overt form is assigned to refer to his 
father. Rather than using a full NP to disambiguate reference, VA relies on the 
contrast between the two third person forms to create a contrast between the 
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father and mother. It is not a coincidence that the overt form was chosen to refer 
to the father, as he is the more thematic figure and the center of this conversa- 
tional episode. In contrast, the zero form is used for the mother, the less thematic 
figure. 

This contrast between the overt enclitic and the zero form in third person is 
similar to the proximate/obviative contrast in Algonquian languages, in which 
proximate forms are used for the third person most central to the discourse and 
the obviative forms for more peripheral third persons (Dahlstrom 1991; 2003; 
2014). As with the proximate/obviative opposition, it would be interesting in 
future work to explore the extent to which the overt/zero alternation in ZAI can 
be sensitive to other factors such as empathy, agency, and point of view. 


4.4 Summary and conclusions 


This chapter summarized the pragmatic status of the two types of third person 
pronominal forms, the zero and the overt subject enclitic form, and explored 
the distribution and alternation of these forms in narrative and conversation. In 
addition to showing the syntactic facts governing the distribution of the overt 
and zero forms, this section showed that an important factor governing their use 
is the relative thematic salience of the referents, wherein the overt pronoun is 
used for more thematic figures and the zero for less thematic figures. Again, the 
ZAI data is unusual in this regard as one would expect the reverse: highly topical 
participants to be zero-coded and less topical participants to be coded with overt 
forms. 

Chapter 6 takes the analysis made in this chapter as a basis to consider the 
relationship between cognitive status and topichood and the expression of topic 
relations between discourse referents and propositions. As will be seen, while 
cognitive status is not a prerequisite for topichood, topic referents usually have 
a certain degree of pragmatic accessibility such that more acceptable topics are 
higher on a cognitive status scale. First, I turn to an analysis of focus structure 
in ZAI, which is the subject of the next chapter. 


?See, in particular, Dahlstrom (2014) in which the author argues that the definitions of both 
proximate or obviative cannot be reduced to that of topic or focus. 
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In this chapter, I move away from the discussion of the specific forms of ZAI 
nominals and the ways that these signal more or less accessible referents and 
turn towards an analysis of the information structure categories of topic and 
focus. Topic and focus relations involve the relations not between discourse ref- 
erents and accessibility but between discourse referents and propositions. That 
is, in similar sentences uttered in different contexts, the cognitive status of two 
referents may be the same, but the function - i.e. topic or focus - may be differ- 
ent; as such, cognitive status is only a precondition for the expression of these 
functions (Lambrecht 1994). The analysis below focuses on pragmatic phenom- 
ena that have particular correlates in clause or sentence structure. As we will see 
from the analysis that follows, the flexible nature of constituent order in ZAI is 
an important resource for ZAI speakers in organizing information structure. 

This chapter aims to show that ZAI is a verb-initial language that displays flex- 
ible syntax whose linear order is strongly motivated by the pragmatic function 
of the utterance. In particular, linear order is determined in large part by deci- 
sions made by the speaker with respect to what the proposition is about, what 
is contextually dependent, what is pragmatically presupposed, and what is as- 
serted. Chapter 6 explores related phenomena from the perspective of ZAI topic 
relations. 

In this chapter, I investigate the organization of focus structure in ZAI again 
with an emphasis on the ways that the various typological characteristics of the 
language - phonological, morphological, and syntactic - interact with each other. 
The ZAI data supports the hypothesis that ZAI speakers mark focus relations 
primarily through the manipulation of constituent order and/or through mor- 
phological marking (for other Zapotec languages, see Broadwell 1999; Lee 2000) 
rather than through prosodic means. There does not seem to be any evidence for 
any pitch accents directly associated with focal material, although elements may 
display various prosodic properties — duration, pitch register, and pitch range - 
that may be related to the position within a given intonation unit in which they 
appear. 

The chapter begins with a discussion of focus structure in ZAI and an analysis 
ofthe conceptualization of Lambrecht (1994) as it applies it to the ZAI data. In the 
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section that follows, I introduce the typology of focus structure proposed by Van 
Valin (1999) and examine the place of ZAI within that typology. I then present and 
discuss a conversational strategy by ZAI speakers involving the parallel, chiastic 
use of predicate focus and argument focus to accomplish specific conversational 
goals. 


5.1 Focus structure 


The term focus structure (Lambrecht 1994) refers to the grammatical means by 
which a language indicates the scope of the assertion in an utterance and differ- 
entiates it from the presupposed or topical material. 

The main contrast in focus structure is between broad focus and argument fo- 
cus. Whereas in broad focus the focus domain extends over more than one con- 
stituent, in argument focus the focus domain extends only over one constituent. 
In broad focus constructions —which invariably involve verb-initial structures in 
ZAI- the verb is part of the assertion. In narrow focus constructions, the verb 
is part of the presupposition. In ZAI, narrow focus constructions tend strongly 
to not be verb-initial. The relevant generalization is the following: the verb will 
form part of the focus domain unless the construction is an argument focus con- 
struction, in which case it forms part of the presupposition. 

There are two types of broad focus, predicate focus and sentence focus. I ad- 
dress these in turn. 


5.1.1 Predicate focus 


Predicate focus is traditionally referred to as a topic-comment construction, in 
which the subject is the topic and the predicate is a comment on that topic.! 
This is the unmarked focus type. The following examples from Lambrecht (1994) 
illustrate this focus construction type in four different languages: English, Italian, 
French, and Japanese. The sentences represent a prototypical response in each 
respective language to the question "How's your car?" which establishes “my 
car" as the topic (boldface indicates focal stress). 


(1) Q: How's your car? 


a. My car/it broke down. English 
b. (La mia macchina) si è rotta. Italian 
c. (Ma voiture) elle est en panne. French 
d. (Kuruma wa) koshooshita. Japanese 


'Predicate focus is discussed in §6.1.2 in terms of topic-comment constructions. 
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In each case, the predicate is a comment or assertion about the subject-topic 
“my car”. In English and Italian, the subject NP is the topic. In French, it is a 
detached NP, and, in Japanese, it is a wa-marked NP. In each of these languages 
the order of constituents is S-V and there is focal stress on the verb. 

The realization of predicate focus is substantially different in ZAI, where pred- 
icate focus constructions are verb-initial: 


(2 guxhiiñe xcoché’ 
gu-xhiifie’ x=coche=e’ 
COMPL-break.down Poss-car-1sc 


H 


“My car broke down? 


Although the subject-topic may be a full NP, as above, a subject pronominal clitic 
is more common: 
(3) guxhiifieni 
gu-xhiiñe -ni E 
COMPL-break.down-3.INAN 
‘It broke down? 


The predicate thus occupies the clause-initial position in ZAI followed by the 
subject-topic, which can be realized as an enclitic or as a full NP.? 

Below is a second example of a prototypical predicate focus construction in 
ZAL 


(4) Q: What did the boy do? 
bidxaagabe tí dxaapahuiini’ 
bi-dxaaga-bel ti dxaapa-huiini’ 
COMPL-encounter-3.HUM one girl-DIM 


‘He encountered a girl’ 


This is a transitive clause where the subject-topic, “the boy”, appears as an enclitic 
on the verb and the predicate, 'encountered a girl' is the comment or assertion 
about the subject-topic. Again, this is a verb-initial construction. 

The verb and the object are in the focus domain in this case, but neither re- 
ceives focal stress in the form of a pitch accent. There is a gradual downdrift in 
pitch from the beginning of the clause to the end, but no specific pitch accent 


?Predicate focus with a transitive verb and two full NP arguments would require the topical 
subject NP to appear before the verb. However, because topical subjects are very rarely coded 
using full NPs, this word order occurs in my corpus only in elicitation contexts. 
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occurs on either the verb or the object. The one H tone in the clause surfaces on 
ti as a result of the floating tone from the third person enclitic -be. This can be 
observed in the pitch track of this utterance shown in Figure 5.1 below: 


20130114 S MRB bidxaagabe ti dxaapahuiini 


1.65775373 2.99275894 
4504 


300 


qe TN 


Pitch (Hz) 


100+ 


707 


50 


bi- dxaaga =be*| ti dxapahuiini’ 


1.658 2.993 
Time (s) 


Figure 5.1: Pitch track 


In general, elements that appear at the beginning of the intonation unit are 
pronounced with longer duration, a higher pitch register and wider pitch range, 
i.e. properties associated with beginnings and endings of intonation units. In this 
case, it is the verbal constituent that occurs in the prosodically more prominent 
position, the beginning of the intonation unit. The object NP constituent occurs 
in the next most prosodically prominent position, the end of the intonation unit. 

Consider, now, the following example, taken from conversation: 


(5) (M18 March 2012, 08:47.0-08:52.0) 


01 bibané lá, 


H H 


bi-bani=a’ la 


COMPL-wake.up-1sG LA 


‘I woke up; 
02 guzé xa 
gu-zi-a'H xa 


COMPL-shower-1sG INTJ 


‘I showered, 
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03 gúé ti jügo  denaránjasi xá 
H LH LH 


gü-e-a' ti jugo de naralHnja-si! xa 


coMPL-drink-1sc one juice of orange-only INTJ 


‘I drank an orange juice only. 


Here, the speaker remembers and tells about the sequential events during a morn- 
ing routine. Each of the three lines is a predicate focus construction. Each clause 
is verb-initial, with the narrator as the subject-topic and each predicate advanc- 
ing the events in the narrative. 

As seen in Figure 5.2, in this case as well, there is no pitch accent associated 
with any of the constituents of the sentence. 


20120318 C TVA 02 gue ti jugo 


450 
300 : o ! | 
i | ra LL 

F E a 

Z : 

E E 
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A 
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50 
gue!” ti ju*go de nara*nja si* |xa 
2.17 3.891 
Time (s) 


Figure 5.2: Pitch track 


In the last line, line 3, The H and LH tones that surface can be directly attributed 
to the underlying tones. The verb güe carries an H tone from the first person 
enclitic. The NPs jugo and naránja both carry an LH tone on the stressed syllable, 
as is characteristic of many Spanish loanwords. Finally, the particle -si attached 
to the object NP contains a floating H tone that surfaces on the final particle xa. 

The principal characteristic of predicate focus constructions in ZAI, therefore, 
is that they involve a verb-initial main clause. Again, the verb is part of the focus 
domain and does not receive focal stress in the form of a pitch accent. Addition- 
ally, there is a gradual downdrift in pitch from the beginning of the clause to the 
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end, but no specific pitch accent occurs on the object either. Below, we will com- 
pare predicate focus constructions to argument focus constructions in which a 
different constituent may occupy the pre-verbal position. First, I discuss sentence 
focus constructions, which are also verb-initial. 


5.1.2 Sentence focus 


I turn now to sentence focus.’ In these, there is no topical subject and the focus 
domain is the entire sentence (again, examples are from Lambrecht 1994). 


(6) Q: What happened? 


a. My car broke down. English 

b. Misié rotta la macchina. Italian 
Lit. “Broke down to me the car’ 

c. Jai ma voiture qui est en panne. French 
Lit. T have my car which broke down' 

d. Kuruma ga koshooshita. Japanese 


Unlike the examples of predicate focus listed in (1), each of the sentences in (6) 
lacks a presupposed topic and, instead, the entire sentence is asserted. English 
uses the same syntactic construction as in (1); however, in this case the subject NP 
receives focal stress. In Italian, the focal stress still falls on the final constituent of 
the sentence, but the syntactic construction is altered so that the focused subject 
NP appears sentence-finally. In French, both the focal stress and the syntactic 
construction differ from (1) and a part of the information is now communicated 
via a relative clause. In Japanese, both the subject and the verb receive focal stress 
and the subject is marked using the morpheme ga rather than wa. 

In ZAI, the construction is formally identical to the predicate focus construc- 
tion in (2), except in this case there is no option to represent the subject as an 
enclitic. It must appear as a lexical NP: 


(7) guxhiiüe xcoché' 
gu-xhiifie' x=coche=e' 
COMPL-break.down Poss-car-1sc 


H 


'My car broke down: 


3Sentence focus is discussed again in $6.11 in terms of presentational or event-reporting 
constructions. 
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As we will see in the discussion of event-reporting constructions in $6.1.1, the 
most common use of sentence focus constructions is presentational construc- 
tions, to introduce new participants to a discourse. Consider the following exam- 
ple taken from a Pear Story narrative: 


(8) bihuinni ti rigola 
bi-huinni ti ri gola 
COMPL-appear one man 


'A man appeared: 


In a typical use such as this, the narrator uses a sentence focus construction to 
introduce a participant into the discourse. As with predicate focus, this is also a 
verb-initial construction which places the verb in the most prominent prosodic 
position. The intransitive subject is introduced as an indefinite noun and occupies 
the position at the end of the intonation unit. There is no topical subject and the 
focus domain is the entire sentence. Here, again, there is no special pitch accent 
associated with this construction. 


5.1.3 Argument focus 


While predicate focus and sentence focus are both types of broad focus, argument 
focus involves narrow focus. In argument focus, the focus domain is a single 
constituent, which may be an object, subject, adjunct, or even a verb (examples 
are from Lambrecht 1994).* 


(9) Q:Iheard your motorcycle broke down. 


. My car broke down. English 
a”. It’s my car that broke down. 
b.  Sié rotta la mia macchina. Italian 


Lit. ‘Broke down my car’ 

b. Ela mia macchina che si é rotta. 
Lit. ‘It’s my car that broke down’ 

c. Cest ma voiture qui est en panne. French 
Lit. ‘It’s my car that broke down’ 

d. Kuruma ga koshooshita. Japanese 


^ Argument focus is discussed in $6.1.3 in terms of identificational constructions. 
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In these sentences, the focus domain is restricted to the NP car. The presupposi- 
tion is that 'something broke down' and the assertion is that it was the speaker's 
car and not something else that broke down. English again uses the same syntac- 
tic S-V-O construction and, as in (6), the subject NP again receives focal stress. 
In Italian, the syntactic construction is altered in such a way that the focal stress 
again falls on the final constituent ofthe sentence. In French, both the focal stress 
and the syntactic construction again differ from (1) and (6), with a part of the 
information again being communicated via a relative clause. In Japanese, the 
subject is marked using the morpheme ga (as in (6d)), and only the subject NP 
receives focal stress. 

In argument focus it is possible for the focused NP to occur post-verbally in 
ZAI, but this is much less common and the preferred order is the following, where 
the focused NP constituent appears pre-verbally in clause-initial position: 


(10) xcoché guxhiifie’ 
x-coche-e'H gu-xhiifie’ 
POSS-car-1sG COMPL-break.down 
“My CAR broke down. 


Below is an example taken from conversation: 


(11) (T and M, 18 March 2012, 16:03.0-16:06.0) 


01 T: ¿tu lá bini ganar, este, primér lugar? 
tul H jal p-ini ganar" este primer" lugar 
who name COMPL-do win INTJ first place 
"Who won, um, first place?' 

02M: ti militár bini ganár dxiqué 
ti militar bi-ini ganar! dxiquel 
one soldier COMPL-do win then 
“A SOLDIER won then. 


Here, the question in line 1 by speaker V introduces the presupposition ’x won 
first place’. Speaker M responds in line 2 with the assertion ‘x is a soldier’ and 
uses a construction in which the subject appears in pre-verbal position followed 
by the verb which forms part of the presupposition. The most prominent prosodic 
position is occupied in this case by the subject NP. 

Consider the following example, also of an argument focus construction. Here, 
the speaker's own statement in line 1 sets up a presupposition which is followed 
in line 2 by an argument focus construction. 
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(12) (M, 18 March 2012, 10:20.5-10:23.5) 


01 nin quí ñahuadiá de endaré gasti’ 
nin qui f-ahua-di-a'H de guendaro-a'H gasti'H 
not.even NEG IRR-eat/drink-NEc-1sc of food-1sc nothing 
‘I didn't even eat/drink any of my food? 

02 jügo  quesí gué 
jugo quel H-silH gu-e-4'H 
juice DEM-only compt-eat/drink=1sc 
'I drank ONLY THE JUICE? 


Note first that the verb ‘to eat/drink’ is the same verb in line 1 as in line 2, the 
phonological form of the verb is conditioned by the TAM prefix. In line 1, the 
speaker sets up the presupposition “I ate/drank x’. He continues in line 2 with 
the assertion °x is only the juice. 

It is not the verb but an NP constituent that is in the prosodically prominent 
position at the beginning of the intonation unit. As above, however, there is no 
particular pitch accent associated with any particular part of the utterance (Fig- 
ure 5.3). 
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Figure 5.3: Pitch track 
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We can compare this construction to the predicate focus construction, 'gue ti 
jugo de naranjasi xa’ in (5) uttered by the same speaker. The constructions carry 
almost identical propositional content, except that in (5) the speaker uses an in- 
definite object NP and in (12) uses a definite object NP. The two utterances differ 
also in the order of constituents, with the object NP occurring pre-verbally in 
the argument focus construction (5) and post-verbally in the predicate focus con- 
struction (12). I return to pairs of utterances such as these in $5.2, where I discuss 
the patterned use of predicate focus followed by argument focus in conversation 
and explore the combined discourse function of the two constructions. 

First, it should be noted, however, that argument focus constructions do not 
have to be NF-initial. A construction such as the following, with a verb-initial 
structure, would also be acceptable in the same situation: 


(13) gué jügo quesi 
gu-e-a'H julHgo quel H-siLH 
coMPL-eat/drink-1sc juice  DEM-only 
'I drank ONLY THE JUICE? 


There is no formal marking that separates this construction from a predicate fo- 
cus construction, leaving it formally ambiguous. However, an NP in pre-verbal 
position unambiguously signals the focal nature of the NP. In verb-initial con- 
structions, focus may fall on the verb. Only contextual information allows the 
participants to understand that the presupposition and assertion in the verb- 
initial version remain the same as in the original construction of line 2 in (12). 
Still, while a verb-initial structure can alternatively be used to communicate ar- 
gument focus, the use of a pre-verbal constituent will always signal argument 
focus, unless the pre-verbal element is a subject NP and a resumptive pronomi- 
nal clitic appears on the verb, as in the case of topicalization (see 86.1.4). 

In the following section, I turn to a related argument focus construction in- 
volving the use of the particle NGA. 


5.1.4 The use of NGA in argument focus 


The particle NGA carries an H tone and is used in two types of constructions. One 
is in copulative constructions, such as in (14), where NGA, according to Pickett 
et al. (1998: 94), "emphasizes" the subject: 
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(14) laabe nga máistru 
laa=be™! nga! mai? stru 
base=3sG NGA teacher 
“HE is a teacher: (Pickett et al. 1998: 94) 


In this example, the independent pronoun functions as the subject of the clause, 
followed by NGA, and then maistru “teacher”. This construction contrasts with the 
alternative copulative construction involving a zero-copula: 


(15) máistru laabé 
maistru laa-belH 
teacher base-3sc 


“He is a teacher. 


These two constructions differ in that while (14) is a type of argument focus 
construction, (15) is an example of predicate focus. 

The NGA particle may be used in other constructions as well. It may be used 
to “emphasize” a subject of a transitive clause, as in (16): 
(16) naangá bi'né ni 

naa nga! bi-i’ni=a’_ ^ nilH 

ISG NGA COMPL-d=1sG 3INAN 
“Tam the one who did it? (Pickett et al. 1998: 98) 


In these cases, a co-referring dependent pronoun appears as an enclitic on the 
verb. In addition, it may be used to “emphasize” a direct object, as in (17). 


(17) Juan nga biiyalu neegue’ 
Juan! nga" bi-uuya-lu' ^ neegue' 
Juan NGA COMPL-see=2sG yesterday 


‘It was Juan who you saw yesterday? (Pickett et al. 1998: 98) 


The function of the NGA particle to provide “emphasis”, as described by Pickett 
et al. (1998), can be understood in terms of Lambrecht (1994) as narrow or argu- 
ment focus. Yet, it differs from argument focus constructions in which NGA is not 
present. Example (17) is not identical to (18), the corresponding argument focus 
construction without the particle NGA: 


(18) Juán biiyalu neegue’ 
Juan" bi-uuya-lu' neegue' 
Juan COMPL-see=2sc yesterday 


“You saw JUAN yesterday? 
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The sentence in (17) requires an exhaustive listing interpretation where it was 
Juan and only Juan who the hearer saw yesterday. Meanwhile, the corresponding 
sentence without NGA in (18) requires only an information focus interpretation 
in which the hearer saw Juan yesterday but may have seen others as well. 

Anexample from a Pear Story narrative illustrates the use of NGA further. Here, 
NGA appears in the third line after the phrase suerte stibe “his luck’. 


(19) 01 ne  biába también dxumi qué 
ne! bi-aba tambien dxumiH quel 
and compPt-fall also basket DIST 
“And the basket fell also? 

02 ne  láabé támbién 


nel! laa=beH tambientH 
and BASE=3sG also 


“And he (fell) also. 
03 suérte  stibé nga gaxhanuu cádxixcuídi casi 
suerte stilH=belH nga! gaxha n-uulH. cadxi xcuildi casi 
luck POSS-3sG NGA close sTAT-be some child almost 
laabé 
laa-belH 
BASE-3SG 
‘It was lucky for him there were some kids close to him (Pear 


Stories, V: 1.15-17) 


The narrator is describing an event in the Pear Story in which the boy as well 
as the basket of pears he is carrying fall from the bike. The narrator uses a con- 
struction involving the particle NGA in the third line to accomplish two impor- 
tant discursive goals. First, the narrator introduces a new participant into the 
discourse, a group of three boys walking by (who would eventually help him). 
Second, the narrator points out that, contrary to the listener's expectations, the 
boy was fortunate to have fallen where he did right as the boys were there. The 
use of NGA after the first constituent, suerte stibe, not only marks the end of the 
assertion that the boy was lucky, it also separates this constituent from the rest 
of the utterance which introduces the boys. 

Finally, in this last example, taken from a conversation between J and T, T re- 
sponds to a question by J about whey and explains that one of the uses of the 
whey is as feed for pigs. T concludes his turn with an argument focus construc- 
tion using NGA in line 5: 
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(20) (T 26 May 2012 (05:15.0-05:20.0)) 
01]: ¿xi rünicabe né  suéru? 
xil! runicabelH nel! sue Hru 


what HAB-do=PL-3.HUM with whey 


‘What do they (people) do with whey?’ 
02T: laani lá, 
laani! laH 


BASE=3.INAN LA 
“As for it (the whey); 

03T: nabé rusirooni bíhui 
nabe ru-si-roo=ni H bihui 
very HAB-CAUS-big=3.INAN pig 
‘It really makes the pigs grow. 

04 T: ngue  rúni 
nguelH ru-ni 
DEM | HAB-do 
"Ihat's why; 

05 T: stale binni ngá riquiiñení 
stale binni! ngaH ri-quiiñe=ni 
much person NGA HAB-use=3.INAN 
‘MANY PEOPLE use it. 


LH 


In this example, J asks T a question in line 1. T begins his response in line 2 
using a LA-marked phrase to establish the whey as the topic referent for the 
next clause. In lines 3-5, T explains that, because feeding pigs whey causes them 
to grow, many people use it. His use of the particle NGA in the last line marks 
the statement as an argument focus construction with the subject NP stale binni 
'many people' as the focused constituent. Because it is a focused constituent, 
there is no resumptive subject enclitic on the verb. 

It is interesting to note that in this example it is the object NP, the whey, that 
appears as an enclitic on the verb, not the subject. We would expect the pronom- 
inal object to appear as an independent form, not a dependent form, yielding the 
following utterance with the same propositional content: stale binni nga riquiiñe 
laani. The use ofthe third person enclitic forms for inanimate objects, as in line 5, 
is actually not an uncommon use and one that requires more attention in future 
work. I have heard it myself on many occasions in informal settings, but have not 
yet encountered it in my corpus, so I have little to say about it at this point. One 
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hypothesis is that it is perhaps the role of the object NP as object-topic in this 
construction that allows it to appear as such and that this is a change in progress. 

In summary, in this chapter we have observed the following pattern in the 
information structure of ZAI: while sentence focus and predicate focus construc- 
tions are consistently verb-initial, argument focus constructions contain either 
pre-verbal constituents (within the clause) or may be verb-initial. That is, con- 
stituent order in ZAI adapts to discourse functions. Pre-verbal elements are ex- 
clusively part of the focus domain, whether argument focus or sentence focus. 

There is no evidence for any pitch accents directly associated with either topi- 
calor focal material, although elements may display various prosodic properties- 
longer duration, higher pitch register, and greater pitch range- that may be re- 
lated to the position within a given intonation unit in which they appear. Focused 
elements (either nominal or verbal constituents) tend to occur in prosodically 
more prominent positions, i.e. beginnings of intonation units. The elements that 
appear at the beginning of intonation units are pronounced with longer dura- 
tion, a higher pitch register and wider pitch range, i.e. properties associated with 
beginnings of intonation units. 

From this perspective, given the range of functions available in the verb-initial 
position, ZAI appears to classify as relatively rigid pragmatically since the do- 
main of focus appears to be confined to the pre-verbal position, but as syntacti- 
cally relatively flexible since the verb-subject-object order is not always strictly 
adhered to. I turn to this discussion in the next section. 


5.1.5 Van Valin's (1999) typology of focus structure 


It is clear from the preceding discussion that languages can differ greatly in fo- 
cus structures and in the linguistic resources they have for carrying out various 
discourse functions. One of the dimensions in which languages can differ is the 
syntactic dimension, whereby languages can be more or less rigid in terms of the 
syntactic arrangement of constituents. As the examples above show, a language 
such as English, for example, appears to have a more rigid syntax than languages 
such as French or Italian. Another dimension is that of the focal domain, includ- 
ing the placement of focal stress, whereby languages can be more or less rigid in 
terms of where the focal domain may lie within a given clause. This observation 
is the basis for a typology of focus structure proposed by Van Valin (1999), which 
I review here. 

Lambrecht (1994) conceptualizes focus structure and focus types across lan- 
guages using the notions predicate focus, sentence focus, and argument focus 
that were reviewed and discussed in the previous section. Based on Lambrecht's 
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conceptualization, Van Valin (1999) proposes a way of comparing and classify- 
ing languages in terms of the relative degree of rigidity or flexibility in their 
constituent order and the relative degree of rigidity or flexibility in their focus 
structure. The distinction between rigid and flexible constituent order was dis- 
cussed above in 82.3. While English is a language that fairly rigidly conforms to 
an S-V-O order, we have seen that the constituents of a ZAI clause are relatively 
flexible. 

Central to his analysis of focus structure as relatively rigid or flexible is Van 
Valin's use of the notion "potential focus domain” Van Valin (1999: 513) defines 
"potential focus domain" as "the part of the sentence in which a focal element 
may potentially be found” In English, for example, the potential focus domain 
is the entire main clause, meaning that focal stress can potentially fall on any 
constituent within the main clause, such as the predicate or the right edge of a 
clause (see (1a)), or on a pre-verbal subject (see (6a), (9a)). English is an example 
of a language with relatively flexible potential focus domain. 

The classification of languages in the two dimensions of rigid or flexible, on the 
one hand, and syntax and focus structure, on the other, yields a framework from 
which to view language diversity, for which Van Valin offers the following two- 
by-two typology: This way of classifying languages is based on whether the order 


Table 5.1: A typology of focus structure (Van Valin 1999) 


Rigid focus structure Flexible focus structure 


Rigid syntax French English 
Flexible syntax Italian Russian 


of constituents in main clauses is primarily dependent on syntactic principles (e.g. 
grammatical relations) or on pragmatic ones (e.g. the (assumed) cognitive status 
of referents involved). On the one hand, constituent order may be constrained 
by pragmatic principles. For instance, a language may forbid the assignment of 
focus to pre-verbal subjects, as in Italian, or reserve a specific syntactic position 
for particularly “newsworthy” information, as in Cayuga (Mithun 1992). That is, 
the domain of focus assignment may be more or less fixed (typically with respect 
to the verb). On the other hand, in those languages where constituent order is 
more tightly constrained by syntactic principles, such as English, the encoding 
of information structure is frequently carried out exclusively by prosodic means, 
leaving constituent order intact. 
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Given that the distinction between rigid and flexible is meant to be understood 
as a continuum rather than as a binary distinction, based on the data reviewed so 
far, we can determine where the potential focus domain of ZAI falls on the con- 
tinuum from rigidity to flexibility and, more generally, where ZAI focus structure 
may be located within Van Valin's typology. 

In terms of focus structure, the potential focus domain in ZAI is relatively 
flexible, given that focused constituents can appear either pre-verbally or post- 
verbally. While in broad focus constructions (i.e. sentence or predicate focus), 
the focus domain is post-verbal, in narrow focus constructions there is a strong 
preference for focused constituents to appear pre-verbally, though post-verbal 
focused constituents are possible. Lexical NPs, whether pre- or post-verbal, are 
usually part of the focus domain, as are pre-verbal independent pronouns. Pre- 
verbal lexical NPs may be either focused NPs or topicalized NPs. In contrast, 
pronominal enclitics are always topical. 

In terms of syntax, ZAI is also relatively flexible as arguments as well as non- 
arguments may occur pre- or post-verbally, oftentimes dictated by the needs of 
focus structure. It appears, therefore, that focus structure is more rigid than syn- 
tax, since focus structure may motivate certain syntactic arrangements while the 
reverse rarely, if ever, holds. That is, syntactic structure does not appear to moti- 
vate changes in the focus domain. In this way, ZAI may tend more towards the 
Italian-type rather than the Russian-type. This can be represented schematically 
as follows: 


Table 5.2: ZAI in Van Valin's (1999) typology of focus structure 


Rigid focus structure => Flexible focus structure 
Rigid syntax French = 7 = English 
( ( ( 
Flexible syntax Italian => ZAI e Russian 


Although focus marking in ZAI does not involve pitch accent, focused mate- 
rial may appear only at the beginning or end of an intonation unit, i.e. positions 
of prosodic prominence. One possible motivation, therefore, for the range of con- 
stituent orders observed in the various ZAI construction types, as well as the dis- 
tinction between broad and narrow focus types, may indeed be prosodic. In verb- 
initial structures, where the verb appears in the prosodically most prominent po- 
sition, the verb strongly tends to form part of the assertion. In non-verb-initial 
structures, where non-verbal elements occupy the prosodically most prominent 
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position, the verb forms part of the presupposition. In other words, if the verb 
is the initial element in the clause, it forms part of the focus domain. Otherwise, 
as in typical cases of argument focus, a non-verbal constituent in the pre-verbal 
clause-initial and prosodically most prominent position signals its focal nature.? 


5.2 Focus structures in discourse: predicate focus plus 
argument focus 


Above, I have reviewed the various types of focus constructions available to ZAI 
speakers. We have seen a number of ways in which speakers exploit various 
combinations of nominal forms and constituent orders to achieve their discur- 
sive goals with respect to the communication of topic and focus relations within 
a clause or sentence. In the final section of this chapter, 1 wish to expand this 
perspective by analyzing three related examples in which the specific combina- 
tion of predicate focus followed by argument focus is employed in spontaneous 
discourse for specific ends. We will see that as well as expressing topic and focus 
relations, the combined use of these construction types aids speakers in accom- 
plishing specific, additional interactional goals. 

In the following example, the speaker is recounting what he ate the night be- 
fore an important event in his life. He explains how he was hungry that night 
and ate as he normally would: 


(21) (M, 18 March 2012, 8:31.0-8:37.0) 

01 má candaaná gueela’ 
ma'H  ca-ndaana-a'H gueela' 
already PRoc-be.hungry-1sc night 
‘I started to be hungry at night. 

02 udahuá normál 
gu-dahua HM norma] 
COMPL-eat.1sc normal 


Tate normal (as I normally would). 


As will be seen in 86.1.4, subject NPs in topicalization constructions also appear in the initial, 
most prominent position in the clause. Similarly, in $6.2 we will see that La-marked phrases, 
with their topic announcing or topic promotion function, are set off in a separate intonation 
unit altogether, among other things offering the phrase prosodic prominence. 
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03 normál udahuá' 
normaH] gu-dahua'H 
normal coMPr-eat.1sc 
‘I ate NORMAL (as I normally would): 


The speaker mentions he was hungry that night in line 1 and follows this in line 
2 with a topic-comment or predicate focus construction in which he states that 
he ate as he normally would, udahua normal. Interestingly, he follows this in line 
3 with an argument focus construction, normal udahua, the mirror image of the 
utterance in line 2. In terms of a pragmatic assertion, however, there is little that 
line 3 adds to the hearer's understanding of the event. The information that the 
speaker ate as he normally would that night has already been transmitted. 

There is no additional pitch accent associated with any part of either utterance, 
as we can observe in the pitch track shown below. We can also see, however, that 
there is no substantial pause between line 2 and line 3. In fact, line 3 is begun at 
the pitch level that line 2 ends with (Figure 5.4). 


20120318 C TVA 02 udahua normal 


450 


Pitch (Hz) 


100- 


704 
50 
udahua!’ norma!l norma!l udahua! 
1.155 2.816 
Time (s) 


Figure 5.4: Pitch track 
The use of the predicate focus construction followed immediately by argument 


focus may be conceptualized as a discursive structure of its own which exploits 
the "parallelism" (Jakobson 1966; Fox 1977) of the mirror image syntactic struc- 


104 


5.2 Focus structures in discourse: predicate focus plus argument focus 


tures employed. One of the functions of this parallelism, or “chiastic structure” 
(Silverstein 1984), is to help the speaker extend his speaking turn for an additional 
intonation unit. At the same time, the predicate focus plus argument focus combi- 
nation together mark the end of the speaker's turn. The speaker cedes the floor, 
though not before providing a captivating end to the re-telling of a seemingly 
routine and uneventful night of eating. More importantly, the use of the chiastic 
structure binds the two intonation units into a couplet to be interpreted together. 

This combined use of predicate focus plus argument focus as a chiastic struc- 
ture is employed often in conversation between ZAI speakers. Below is a second 
example. Here, the speaker is talking about his participation in an international 
marathon in Mexico City 25 years prior and uses the chiastic structure of pred- 
icate focus plus argument focus in lines 2-3 to highlight his young age at the 
time: 


(22) (T and M, 19 March 2012, 0:58.0-1:04.0) 
01T: dxi  bixoofié jaa maratón internacionál qué lá, 
dxi bi-xooñe=a! jaa marató!n internacional" quel! laH 
when CoMPL-run=1sG INTJ marathon international DEM LA 
‘When I ran the international marathon? 
02T: má napá veintidós iza 
ma'H  n-apa-a'H 
already HAB-have=1sc twenty-two year 


veintidos! iza 


‘I was twenty-two years old. 


03 T: veintidós iza napá dxiqué 
veintidos! iza n-apa-a'H dxiquelH 
twenty-two year HAB-have-1sc then 
‘I was TWENTY-TWO then? 


After beginning his turn with a La-marked adverbial phrase in line 1 which 
introduces the event of the international marathon as topical, the speaker uses 
a predicate focus construction in line 2 to remark on his age at the time. In line 
3, the speaker repeats the semantically equivalent utterance, this time using an 
argument focus construction in which his age appears pre-verbally. 

In the final example, also from conversation, a similar use of the parallel, chi- 
astic structure is used. This time the particle NGA can be observed. In the first 
two lines, T asks C what kinds of crops his father used to grow on his plot of 
land and whether he had cattle. C responds in lines 3-8. 


ĉI thank Richard Rhodes for useful comments on this point. 
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(23) (Tand C, 27 Sept 2012, 1:33.5-1:49.0) 
01 T: ¿xi bídxí babe ya? 
xil! bi-dxi'Hba-belH ya’ 
what COMPL-grow-3.HUM Q 


"What did he grow?' 

02 T: ¿gupabe yüzé lá? 
gu-apa-belH yulHzelH JaH 
COMPL-have=3.HUM cattle Q 
‘Did he have cattle?’ 

03 C: bidxí'babe püru  xubá 


bi-dxi'Hba-belH puru xupa'H 
COMPL-grow=3.HUM only | maize 
‘He only grew maize’ 

04C: purtí cheri lá, 
purtiH cheritH laH 
because here LA 
“Because around here; 

05C: püru nga nga rudxíbacabé 
pulHru ngal! nga! ru-dxi'Hba-ca-be 
only DEM NGA HAB-grow=PL=3.HUM 
“Only that is what they grow? 

06C: má püru  xubá' 
ma puru xuba'H 
already only maize 
“Now just maize. 

07 C: ira íxé  cámpesínu nuu  ládü rí lá, 
guira LH ixelH campesilnu n-uul! JalHqu rH. laH 
all all peasant STAT-be side DEM LA 
"All the peasants here (lit. ‘that are on this side’); 

08C: má púru  xubá  rudxí'bacabé 
ma'H pu ru xuba'H ru-dxi'Hba-ca-be — LH 

now just maize HAB=grow=PL=3.HUM 


“Now they grow only maize: 


In response to T's question in lines 1-2, C responds with a predicate focus 
construction in line 3, saying that his father only cultivated maize. In lines 4- 
5, he continues this thought stating that in that region maize is the only crop 
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that was grown and does so using an argument focus construction involving the 
particle NGA. He repeats this thought again in line 6 in a verb-less clause. He ends 
his turn in lines 7-8 with an argument focus construction that is a mirror image 
of line 3. 

Again, the use of the predicate focus construction followed immediately by 
argument focus can be conceptualized as a chiastic structure that exploits the 
parallelism of the mirror image syntactic structures employed. In using this par- 
allel, chiastic structure, the two intonation units are bound into a couplet to be 
interpreted together, and the speaker extends his speaking turn for an additional 
intonation unit, with the second part, the argument focus construction, marking 
the end of the speaker's turn, thereby ceding the floor. 


5.3 Summary and conclusions 


In summary, this chapter explored the range of types of focus constructions in 
the ZAI data. As we saw, in the information structure of ZAI, sentence focus and 
predicate focus constructions are consistently verb-initial and argument focus 
constructions contain either pre-verbal constituents (within the clause) or, alter- 
natively, may be verb-initial. A summary of these facts is shown in Table 5.3: 


Table 5.3: Focus constructions in ZAI 


Context Example Focus type Constituent order 
How's your car? guxhiineni Predicate focus  V-initial 
What happened? guxhiine xcoché” Sentence focus  V-initial 


I heard your motor- xcoché guxhiifie’ Argument focus  pre-verbal NP 
cycle broke down 


In addition, this chapter showed that there is no evidence for pitch accents di- 
rectly associated with focal material. However, elements may display various 
prosodic properties— longer duration, higher pitch register, and greater pitch 
range- related to their position within a given intonation unit. In particular, fo- 
cused elements, be they nominal or verbal constituents, tend to occur in prosod- 
ically more prominent positions, i.e. beginnings of intonation units. Pre-verbal 
elements, for their part, are exclusively part ofthe focus domain. This was viewed 
as a possible prosodic motivation for the focus domain being associated primar- 
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ily with the initial position, be it the verb in a verb-initial construction or a pre- 
verbal element. 

These observations led us to examine the place of ZAI within the typology of 
focus structure proposed by Van Valin (1999). First, because arguments as well 
as non-arguments may occur pre- or post-verbally, we described ZAI as syntacti- 
cally relatively flexible. Second, given that focused constituents can appear either 
pre-verbally or post-verbally, it was determined that the potential focus domain 
in ZAI is also relatively flexible. In broad focus constructions (i.e. sentence or 
predicate focus), the focus domain is post-verbal and, in narrow focus construc- 
tions, there is a strong preference for focused constituents to appear pre-verbally 
(though post-verbal focused constituents are possible). Lexical NPs, whether pre- 
or post-verbal, are usually part of the focus domain, as are pre-verbal indepen- 
dent pronouns.’ In contrast, pronominal enclitics are always topical. 

However, it does appear that focus structure is more rigid than syntax, since 
focus structure can motivate certain syntactic arrangements while the reverse 
never holds. That is, syntactic structure does not appear to motivate changes in 
the focus domain. Therefore, ZAI may tend more towards the Italian-type rather 
than the Russian-type (cf. Table 5.2). 

Finally, the chapter concluded with a discussion of a conversational strategy 
used by ZAI speakers involving the successive use of predicate focus and ar- 
gument focus to accomplish specific conversational goals. The use of the predi- 
cate focus construction followed immediately by argument focus was analyzed 
as a chiastic structure that exploits the parallelism of the mirror image syntactic 
structures employed. In using this chiastic structure, the two intonation units 
are bound into a couplet to be interpreted together, and the speaker extends his 
speaking turn for an additional intonation unit, with the second part, the argu- 
ment focus construction, marking the end of the speaker's turn, ceding the floor. 


7Pre-verbal lexical NPs may also represent topicalized NPs (cf. §6.1.4). 
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The chapter discusses the linguistic resources available to ZAI speakers for ex- 
pressing topic relations. This discussion of topic relations will set the stage for 
the analysis of a very commonly used topic-marking strategy involving the dis- 
course particle LA. 

In this discussion, 1 follow Lambrecht (1994) and use the term topic or topic 
referent to describe the referent or entity which the proposition is about. As 
such, the topic or topic referent is the referent or entity which bears a topic 
relation to the proposition. It is not to be confused with "old" information, which 
refers to the cognitive status of a referent. From this perspective, information 
which performs the role of topic in a given proposition may have a cognitive 
status that is either “old” or “new”. On the givenness hierarchy discussed in 83.2, 
topic referents must be identifiable in the mind of the speaker and hearer, and 
continuous topics are usually also activated and familiar, but this is not a pre- 
requisite for topic-hood. Instead, it is the relation that the topic referent or entity 
bears to the rest ofthe proposition that is significant. By contrast, the terms topic 
constituent or topic NP refer to the corresponding linguistic expression and not 
the referent or entity to which that expression refers. 

Again, as was mentioned in the previous chapter, it is important to bear in 
mind that stress and pauses play a critical structural function in ZAI prosody 
(see 82.2). Pitch accents, however, do not play a role in the marking of topic or 
focus relations in ZAL! 


6.1 Topic constructions 


In Chapter 3 we saw that the cognitive status of discourse referents has observ- 
able and direct correlates in ZAI grammar in terms of nominal forms and the 
grammatical roles — A, S, or O - in which they tend to occur. The cognitive sta- 
tus of referents correlates highly with the pragmatic acceptability of sentences 


'We may keep in mind, as Crocco (2009: 15) states, that “the actual realization of the prosodic 
marking of topicality may vary according to the different positions occupied by the topic with 
respect to the prosodic nucleus of the utterance” 
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in other ways as well. For example, because insufficiently accessible topic ref- 
erents are more difficult for hearers to interpret, topic referents tend to have a 
certain degree of pragmatic accessibility. Lambrecht (1994: 165) expresses this 
correlation in terms of a “Topic Acceptability Scale" by which more acceptable 
topics are coded by linguistic expressions that are higher on a cognitive status 
scale, such as the Givenness Hierarchy in Table 3.24, and less acceptable topics 
are coded by expressions which are lower on this scale. For ZAI, therefore, we 
would predict that the most acceptable topics would be coded by subject clitics, 
while the least acceptable topics would be coded by indefinite NPs or bare nouns. 
In addition, we will see that there is also a correlation between the information 
structure of certain types of constructions and the cognitive status of the topic 
referents involved. In particular, in focus or activated referents do not occur in 
presentational or event-reporting constructions, and type-identifiable referents 
do not occur in "marked topic" or detachment constructions involving the parti- 
cle LA. In other words, NPs in presentational constructions are never pronominal 
forms and NPs in detached, La-marked constructions are never indefinite. 


6.1.1 Presentational constructions 


Cross-linguistically, statements about the weather tend to be thetic construc- 
tions.” An example is presented in (1): 


(1) cayaba  nisaguie 
ca-yaba nisa-guie 
PROG-fall water-stone 
‘Rain falls? 


The construction is verb-initial and the lexical, subject NP is a bare noun. The 
subject is not topical and the focus domain is the entire sentence. 
The following example from a Pear Story narrative shows an event-reporting 
construction with a presentational function: 
(2 rihuinni tí rigola 
ri-huinnilH ti rigola 
HAB-appear one man 


€ > 
A man appears. 


“Constructions such as these are also labeled "sentence focus”; see §5.1.2. They are sometimes 
also referred to as “out-of-the-blue” sentences. 
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The construction, used to introduce a new participant into a discourse, is also 
verb-initial and here the subject is a lexical, indefinite NP. Again, there is no 
topical subject, the focus domain is the entire sentence, and it lacks a presupposed 
topic. In other words, it is thetic, i.e. the whole sentence is asserted. 

In the Pear Story corpus, new referents are always introduced as lexical NPs, 
most often in the O role, followed by the S role, and much more rarely in the A 
role (see Table 3.8). When we take into account animacy, however, new referents 
are introduced at a higher rate in the S role than the O role (see Table 3.9). That 
is, the majority of human referents in the Pear Story corpus are introduced using 
presentational constructions of the type in (2). New referents introduced in the 
O role are introduced using topic-comment sentences, which I discuss in 86.1.2. 


6.1.2 Topic-comment 


In the following example from a Pear Story narrative, the subject in line 2 is the 
topic, and the predicate is a comment or assertion about the subject-topic. 


(3) (Pear Stories, M: 1.4) 
01 má bihuinni tí senor 
ma'H  bi-huinnilH ti seño! Hr 

already COMPL-appear one man 
'A man appeared: 

02 cuchuugube péra 
cu-chuugu'-belH pera 
PROG-cut=3sG pear 


‘He (was) cutting pears. 


The narrator uses a presentational clause in line 1 to introduce the man and, in 
the second line, uses a topic-comment construction to predicate a property (i.e. 
that he was cutting pears) about that man, an already established referent. The 
subject-topic in line 2 appears as an enclitic on the verb. 

The subject NP, when topical, appears as an enclitic on the verb. In rare cases, 
such as in a transitive clause with a topical object, the subject NP may occur 
as a lexical NP. Invariably, however, like event-reporting constructions, topic- 
comment constructions in ZAI are always verb-initial (except in cases of topi- 
calization or ‘marked’ topics). Therefore, because the verb-initial construction is 
compatible with other pragmatic construals, such as event-reporting or identifi- 
cational constructions, we can consider the verb-initial topic-comment construc- 
tion the unmarked type. I discuss identificational constructions next. 
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6.1.3 Identificational constructions 


Also referred to as an argument focus construction (cf. 85.1.3), an identificational 
construction contains a topical argument and the focus domain is a single con- 
stituent. This focused constituent may occur in the O role, as in (4), a response 
to the question "What did he cut?": 


(4 Q: What did he cut? 
péra  cuchuugube 
pera cu-chuugu' -bel 
pear PROG-cut=3sG 
‘He was cutting PEARS? 


Here, the subject-topic in the A role appears as an enclitic on the verb and the 
focused NP in the O role is placed in pre-verbal position. It is just as acceptable 
and common, however, in the same communicative context, to respond with a 
verb-initial construction with the object in clause-final position, as in (5): 


(5) (Q: What did he cut?) 
cuchuugube péra 
cu-chuugu'-belH pelHra 
PROG-cut=3sG pear 
‘He was cutting PEARS? 


Out of context, the construction in (5) is formally ambiguous between an identi- 
ficational construction and a topic-comment construction. While the verb-initial 
construction can be interpreted as either, the object-initial construction can only 
be interpreted as an identificational construction. 

In identificational constructions, the single focused constituent may also be an 
adjunct. As above, the adjunct may appear clause-initially (6) or clause-finally (7): 


(6) (Q: How did he finish?) 


naguéendá bíluxebé 
na-guee.EndalH pi-Iuxe-belH 
STAT-fast COMPL-finish-3.HUM 


*He finished FAST: 
(7) (Q: How did he finish?) 


biluxebe náguéendá 
bi-luxe=be H na-gueeHndalH 
COMPL-finish-3.HUM STAT-fast 

‘He finished FAST? 
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In (6), the focused constituent is an adverb and appears in pre-verbal position and 
the subject-topic again appears as an enclitic on the verb. In contrast, in (7), the 
subject-topic again appears as an enclitic on the verb but the focused constituent 
appears in clause-final position. 

Finally, the single focused constituent in an identificational construction may 
also be a subject. Again, the focused subject can appear pre-verbally (8) or post- 
verbally (9): 


(8) Q: Who fell? 
badu que  biába 
badu quel! bi-aba 
boy bist ComPL-fall 


‘THE BOY fell’ 


(9) Q: Who fell? 
biaba badu qué 
bi-aba badu que! 
COMPL-fall boy DIST 


“The boy fell’ 


If, however, the subject is coded as a pronominal NP, it may only appear pre- 
verbally as an independent form, as in (10). Unlike dependent pronouns, indepen- 
dent pronouns are always stressed. 


(10) Q: Who fell? 
laabe biába 
laa-belH  bi-aba 
BASE=3.HUM COMPL-fall 
“HE fell? 


The focused subject cannot appear as an enclitic, as shown in (11). 


(11) Q: Who fell? 

#biababé 

bi-aba-belH 

COMPL-fall-3.HUM 

“He fell? 
As an unaccented pronominal form, it is unsurprising that the subject enclitic 
cannot function as a focused constituent. This can be seen in transitive environ- 


ments as well, where focused pronominal subjects in the A role must occur as 
independent pronouns in pre-verbal positions, as in (12): 
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(12) Q: Who cut the pears? 
laabe bíchuugu ca pérá qué 
laa-be.H ^ bi-chuugu' ca pera quel H 
base-3.HUM COMPL-cut PL pear DIST 


‘HE cut the pears. 


The semantically equivalent form with a pronominal subject enclitic is pragmat- 
ically inappropriate in the same context: 


(13) Q: Who cut the pears? 
?bichuugube capérá qué 
bi-chuugu'-belH ca pel Hra quel! 
COMPL-cut=3.HUM PL pear DIST 


‘He cut the pears. 


In transitive constructions with a topical object, the focused subject constituent 
must appear before the verb, as in (14). 


(14) Q: Who cut the pears? 
rígola que  bíchuugu capérá qué 
ri gola que bi-chuugu' ca pera quel E 


man DIST COMPL-cut PLpear DIST 
"THE MAN cut the pears. 


Here, the object-topic appears as a bare NP in post-verbal position and the fo- 
cused subject appears pre-verbally. If the subject appears as a lexical NP in the 
position immediately after the verb, the construction can only be interpreted as 
an event-reporting construction: 


(15) bichuugu rigola que péra qué 
bi-chuugu' ri“ gola quel pera que 
COMPL-cut man DIST pear DIST 


LH 


"Ihe man cut the pears’ 


This construction would not be used as an answer to the question “Who cut 
the pears?". The only way for a lexical NP functioning as a focused subject in 
the A role to appear after the verb would be for the object NP to appear as an 
independent pronominal form, as in (16): 
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(16) Q: Who cut the pears? 
bichuugu rígola que  laácáni 
bi-chuugu' ri gola que!" laa=ca=nil 
COMPL-cut man DIST  BASE-PL-3 


"THE MAN cut them? 


While acceptable, such a construction is not considered common or natural by 
the ZAI speakers with whom I worked and was produced only in elicitation set- 
tings. 

In summary, based on the above discussion, two factors can be observed to 
interact closely in the expression of topic relations in ZAI: constituent order and 
nominal form. Verb-initial clauses are compatible with the widest range of prag- 
matic construals as they can be employed in event-reporting, topic-comment, 
and identificational constructions. Lexical NPs in any of these three construc- 
tion types typically signal a constituent that forms part of the focus domain. In- 
dependent pronominal forms, for their part, may signal topical or focal material, 
depending on position and on context. Meanwhile, dependent forms, i.e. subject 
enclitics, are used exclusively for subject-topics. Pre-verbal constituents, whether 
subjects, objects, or adjuncts, are almost exclusively focused constituents of iden- 
tificational constructions. One exception to this is the topicalization construction, 
which I turn to next. 


6.1.4 Topicalization 


Arguments that appear immediately before the verb form part of the focus do- 
main (86.1.3). This is the case in an identificational construction, where the fo- 
cused constituent can be an object (4), an adjunct (6), or a subject (12). In a topi- 
calization construction, however, a pre-verbal subject is followed by a resumptive 
subject enclitic on the verb, as in the following example: 


(17 laabe bíchuugube péra 
laa-be! bi-chuugu'-bel! pera 
base=3sG coMPL-cut-3sG pear 


‘He cut pears’ 


In contrast to (12) where the pre-verbal pronoun functions as a focused con- 
stituent, here the pronoun in pre-verbal position functions as a subject-topic, 
as signaled by the co-indexed subject clitic. The predicate is a comment on that 
topic. 
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Topicalization constructions typically occur with referents that have already 
been introduced. In the following example, the definite NP in pre-verbal position 
in line 4 refers to an already introduced referent (18): 


(18) (Pear Stories, T: 1.25-27) 

01 huaxa neza ze xcuídi que lá, 
huaxa neza ze xcui di que! laH 
but  pathPART.goboy DIST LA 
“But on the path that the boy went la, 

02 málásí  bídxaagabé tí badudxaapahuiini 
maHlasi bi-dxaagalH-belH ti badudxaapa-huiini 
suddenly coMPr-cross-3sG INDEF girl-DIM 
‘Suddenly he encountered a little girl’ 

03 dxí'ba sti — bícícléta 
dxi'Hba-9Q — stil biciclePta 
PART.climb-3 other bicycle 


“(She was) on another bicycle’ 


04 badudxaapahuiini que — güxha ziña bandá nuu 
badudxaapa-huiini que!" gu-xha=0 ziña banda’! n-uulH 
girl-DIM DIST COMPL-knock=3 palm shade — srAr-be 
iquébé 
ique-belH 
head-3sc 


"Ihe little girl knocked off the hat that was on his head: 


A new participant in the discourse, the bike girl, is introduced in line 2 as an 
indefinite, lexical NP in the O role, ti badudxaapahuiini ‘a little girl”. This refer- 
ent appears again in pre-verbal position in line 4, as a definite NP in pre-verbal 
position, and coincides with a change in subject from the previous clause. This 
is not an identification construction, however, but a topicalization construction 
in which the bike girl is promoted to topic.* 

There are two elements that permit the analysis of this construction as a top- 
icalization construction rather than an identificational one. First, whereas in an 
identificational construction the predicate forms part of the presupposition, here 


>There is, in fact, no difference in formal marking between the zero form and no subject enclitic. 
For this reason, the contrast between the two constructions can only be elicited in discursive 
contexts and then discussed with native speaker consultants who, in my experience, are then 
readily able to recognize the appropriate interpretation. 
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the predicate is a comment on the topic. There is nothing in the context that ties 
the predicate as already part of the discourse. Second, as we saw in the previ- 
ous chapter, the zero third person pronominal enclitic form is commonly used 
by speakers to signal the bike girl as the less thematic participant. This is true in 
this particular narration of the Pear Story as well. In fact, the zero third person 
form was assigned to the bike girl in the previous intonation unit, in line 3. Line 
4 is thus a topic-comment construction about the bike girl. 

The following example further illustrates a similar topicalization construction, 
again from a Pear Story narrative: 


(19) (Pear Stories, M: 1.61-64) 


01 iza'na sombréru que rá nüubé 
gu-iza’na=@ sombre ru que! ra n-uulH=pelH 
COMPL-took-3sc hat DIST LOC STAT-be-3.HUM 
‘(He) took the hat to where he (the boy) was? 

02 laabe bísiga' debe láa chonna  péra 
laa-bel.! ^ bi-si-ga'de-bel! laa=0 ^ chonnal! pelHra 
BASE=3.HUM COMPL-CAUS-give=3.HUM BASE=35G three pear 


‘He (the boy) gave him three pears. 


In line 1, the narrator uses a topic-comment construction to tell how one of the 
three boys, the boy with the paddleball, takes the hat to where the bike boy is. 
The boy with the paddleball functions as the subject-topic and is encoded using 
the zero third person enclitic. In line 2, the bike boy is promoted to topic through 
the topicalization construction. We see the use of the independent pronominal 
form in pre-verbal position which is followed by the resumptive subject enclitic. 
We also see the use of the zero third person form in this line to refer to the boy 
with the paddleball. 


6.1.5 Detached or La-marked constructions 


One final sub-class of topic phrases is found with the particle LA where, similar 
to a topicalization construction, the NP appears before the verb and is co-indexed 
by a subject enclitic on the verb: 


(20) laabe la, cuchuugube péra 
laa-bel ]aH 


base=3sG DEM PROG-cut-3sG pear 


cu-chuugu'-belH pelHra 


‘As for him, he was cutting pears’ 
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Constructions such as that in (20) were addressed briefly above in §3.1.7.2. In con- 
trast to the similar, semantically equivalent constructions in (12) and (17), here 
the NP is set off in a separate intonation unit marked by the particle LA and ac- 
companied by an audible pause. In some contexts such here in (20), La-marked 
phrases have a topic promoting function similar to a topicalization construction. 
In other contexts, however, La-marked phrases can have additional discourse 
functions. What are the main functions of the LA construction, how does it com- 
pare cross-linguistically, and what are its uses in spontaneous conversation? This 
is the focus of the rest of this chapter. 


6.2 Topic relations and the LA particle in discourse 


The LA particle is used widely in ZAI discourse and does not have referential 
meaning, but interacts with constituent order and intonation. It carries a High 
tone and invariably appears at the end of an IU, followed by a pause (never any- 
where else). In this section, I review the range of constructions in which LA oc- 
curs, including adverbial, conditional, and left-detached clauses, and assess its 
possible status as a topic marker. I conclude by exploring and commenting on 
the functions of LA in extended discourse and conversation. 

LA is used consistently in temporal clauses that advance or give information 
about the sequence of events in a narrative, as in (21) and (22): 


(21) (Pear Stories, T: 1.28-29) 


01 óra  bidxiguetalube bíiyabe bádudxaapa 
olHra bi-dxiguetalu-belH ^ bi-uuya-belH badudxaapa 
when COMPL-turn=3SG.ANIM COMPL-see-3sG.ANIM girl 
que lá, 
quel H laH 
DIST LA 
"WWhen he turned and saw that girl la; 

02  bidxelasaa biciclétanebé tí guieroo'ba 
bi-dxela-saa biciclePta-nel-belH ti guie-roo'ba 


COMPL-find-RECIP bicycle-with-3sG.ANIM one stone-AUG 
‘He crashed his bike against the rock’ 
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(22) (Pear Stories, Ts: 1.8-9) 


01 raque má zeeda tí xcuídihuiini lá, 
raquel! ma'H — zeedalH ti xcuidi-huiini la? 
then already PART.come INDEF boy-DIM LA 
“Then as a little boy arrives la’ 

02 biiyabe rá | cuchuugu pérá qué 
bi-iya-be ra  cu-chuugu'-Q pera que"! 


COMPL-see=3SG.ANIM when PROG-pick-3 pear DIST 


‘He saw he (the man) was cutting the pears. 


This use in temporal clauses is extremely common and, despite the fact that 
speakers do not deem it obligatory, it is rare to find cases in spontaneous speech 
in which La is absent.* 

It is also possible to use LA discourse-initially: 


(23) (Lexu ne gueu) 

01 Ni chigüeniá laatu dí lá 
Ni chigüe-nel H-a'H ]aa- tu LH di'H laH 
REL POT.say-with=1SG base-3PL.ANIM DEM LA 
‘This that I will tell you la’ 

02 bizaacani má xadxi 
bi-zaaca-nilH ma'H  xadxi 
COMPL-happen-3sG.INAN already time 


‘it happened some time ago? 


This discourse-initial use of LA has a similar function to the use of LA with tem- 
poral clauses mentioned above as it presents background knowledge or links ele- 
ments of the discourse with the setting. The LA particle also appears consistently 
at the end of the initial phrase of conditionals, as in (24): 


(24) Pa  guiába  nisaguie  guixí la, qué ziaá 
pal?! guiH-aba nisa-guie — guixi'H laH que! ziH-e=a’ 
if | Por-fall water-stone tomorrow LA NEG FUT-go=1sG 


“Tf it rains tomorrow la, I won't go? (Pickett et al. 1998: 109) 


“A tentative hypothesis in this regard may be that this use could be related to the lack of tempo- 
ral or tense information in the verb. ZAI verbs obligatorily take aspectual prefixes, although it 
is an open question to what extent those prefixes convey tense or mood information (cf. 82.3.1). 
More detailed study is required in this direction to determine whether this is the case. 
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Both adverbial and conditional clauses are known to be explicitly marked in other 
languages as well (see Thompson et al. 2007: 292). For example, in Hua (Papuan) 
topics, interrogatives, conditionals are marked with ve (Haiman 1978). In Turk- 
ish, a conditional suffix also marks topics (Kerslake 1996). Such adverbials and 
conditionals are not the only clauses to be marked as topics, as it is extremely 
common to find various types of adverbial clauses functioning as topics. Conces- 
sion, reason, time and condition clauses in Chinese may all occur with the four 
topic/interrogative particles (Thompson et al. 2007: 293). In Godié (Kru (Ivory 
Coast)), a non-final morpheme occurs at the ends of adverbial clauses function- 
ing as topics and single nouns which function as topics may also be similarly 
marked (Marchese 1977; 1987). In Lisu (Tibeto-Burman), adverbial clauses func- 
tioning as topics are marked with the same marker nya which is used for NP 
topics (Ihompson et al. 2007: 294). In Karbi (Tibeto-Burman), the additive par- 
ticle marks contrastive topics (Konnerth 2013). The same is true in Central Kur- 
dish, where the additive particle marks topics as well as temporal, spatial clauses 
(Opengin 2013). 

The question, therefore, is whether we can assume LA is a topic marker. Ac- 
cording to Chafe (1976: 50) (see also Li & Thompson 1976), topics may have the 
following characteristics: a) they appear in sentence-initial position; b) they are 
discourse dependent; c) they need not be arguments of the main predication; d) 
they are definite; and e) they set a "spatial, temporal, or individual framework 
within which the main predication holds.” 

These facts fit with an analysis in which LA is involved in the marking of topical 
information. This does, in fact, appear to be the case, as LA can appear with topical 
NPs, but never with focused initial NPs: 


(25) ¿tu bíni ni? Tomás  ('la) bi'ni ni 
tul! bi-uni nilH Tomals bi-uni ni 
who COMPL-do 3sG.INAN Tomás COMPL-do 3SG.INAN 


"Who did it? Tomás (“la) did it? 


LH 


There are several reasons why it is common for topical adverbial or conditional 
clauses to play this discourse cohesion role. First, background temporal or spa- 
tial clauses may function as a "scene-setting" topic for the matrix clause (Lam- 
brecht 1994: 125). Second, their main function is to link the preceding clause with 
the clause to which they are attached and, at the same time, set a framework 
within which the following predication holds (Thompson et al. 2007: 294). Third, 
they serve to recapitulate already-mentioned material, i.e. to establish common 
ground between interlocutors. Finally, there is often a H pitch that appears on 
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the end of the first intonation unit, then falling on the second. This helps bind 
the information into a couplet structure which allows for interpretation together 
(cf. $5.2; see also Sicoli (2007: 126-127).? 


6.2.1 Left-detachment constructions 


The topic-marking function of LA can be seen in left-detached constructions as 
well. In a left-detached construction, an active or accessible lexical or pronominal 
NP is set off from the matrix clause without a verb by the La particle and a pause, 
and is then taken up again in the following matrix clause by a co-indexed element. 
In (26), line 3, taken from a Pear Story narrative, the narrator uses an independent 
pronoun followed by LA as well as by a pause in the intonation: 


(26) (Pear Stories, Ts: 1.30—33) 

01 biabantaabé 
bi-abantaa-bel 
comPL-fall.hard=3SG.ANIM 
‘He fell? 

02 bireeche dxumi  péra  stibé 
bi-reeche  dxumilH pera st H- peLH 
COMPL-spill basket pear ^ Poss-3sG.ANIM 


‘His basket of pears spilled’ 


03 laabe la, 
laa-bel H ]aH 
BASE=3SG.ANIM LA 
“He la, 

04 biiyadxisibe bádudxaapahuiini qué 
bi-uuyadxisi-be"! ^^ badudxaapa-huiini que! 
COMPL-look-3sG.ANIM girl-DIM DIST 


‘He looked at that little girl’ 


The use of LA at the end of the intonation unit marks the referent of the indepen- 
dent pronoun, the bike boy, as the topic of the subsequent clause. This is also a 
different topic referent than the topic referent of line 2. 

The signaling of a different main-clausal subject (or object), as well as a differ- 
ent topic, from the previous clause is an extremely common use of LA. Below is 
another example, this time from casual conversation: 


In this contrasting and textual cohesion function, the ZAI morpheme appears to have charac- 
teristics similar to the Somali morpheme baa reported in Matié & Wedgwood (2013: 138-140). 
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(27) (20070730 TVA) 
01 xagueté nisa runidxi binni 
xaguete! nisa ru-nidxi binni! 


under water HAB-dive person 


"Under the water people dive’ 
02 ne lú nisa lá, 

neH ju nisa laH 

and face water LA 


‘And above water la; 

03 rixuubacabé 
ri-xuuba'-ca-belH 
HAB-Swim=PL-3SG.ANIM 


“they swim. 


After offering one alternative in line 1 to what people may do under the water, 
the speaker switches the topic in line 2, marked by the use of La, to what people 
may do above water. In this way, the left-detachment construction marked by 
LA is often used to mark a shift in attention from one to another of two or more 
already topical referents. 

To summarize briefly, we have observed thus far that the LA particle serves 
the following two main discourse functions: 1) it consistently appears at the end 
of sentence-initial adverbial clauses and conditionals, i.e. in a frame-setting or 
delimiting function, and 2) it may signal changes in topic or boundaries of topical 
units, i.e. as a contrastive topic marker. In this way, constructions with LA form 
part of the background presuppositions which, as Thompson et al. (2007: 292) 
note, "establish a framework within which to proceed with a discourse, in the 
same way a question does” In fact, all of the constructions involving LA that we 
have reviewed so far share a common morphology with yes/no questions. 


6.2.2 Yes/no questions 


Yes/no questions in ZAI are formed by the addition of a question marker that 
has the exact same form as a sentence-initial adverbial clause or conditional (also 
carries a H tone): 


(28) ,;riuuladxu' Lulá lá? 
ri-yuu-ladxi-lu! — Lula'H laH 
HAB-enter-gut-2sc Oaxaca LA 


“Do you like Oaxaca?” 
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There are three principal reasons to think this is the same morpheme as the dis- 
course particle La. First, as we saw in 82.3, it is uncommon in V-initial languages 
for question particles to occur in clause-final position (Payne 1990). Second, com- 
mon morphology has been found cross-linguistically between interrogatives and 
conditionals (cf. Haiman 1978). Finally, conditional markers are known to consis- 
tently develop out of interrogative particles (Kónig & Siemund 2007: 296). 

A possible reason for the existence of such a connection in ZAI is that the 
LA particle is used by ZAI speakers as a resource in interaction for managing 
the common ground. More specifically, La can be seen as a “try-marking” de- 
vice (Sacks et al. 1974). Sacks et al. (1974) define a “try-marker” as the use of 
an accessible form, with upward intonation contour, followed by a short pause, 
possibly searching for confirmation of the referent from other participants (cf. 
Pekarek Doehler 2011). One way to think about this is to think of sentences that 
are marked with LA as similar to “mini-conversations” (Thompson et al. 2007: 
292). For example, the conditional construction in (24) is semantically similar to 
(29): 


(29) A: ¿chi guiaba nisaguie guixi’ la? Ts it going to rain tomorrow?” 
B: ziaba Tt will? 
A: que ziaá' T won't go. 


Here, Speaker A uses a LA-marked phrase (similar to the protasis in the cor- 
responding conditional construction in (24)) to seek confirmation from B in the 
form of a yes/no response. In this case, B's explicit response provides a shared 
ground within which A can proceed to effectively convey the main propositional 
content (the apodosis in the corresponding conditional construction), i.e. that he 
won't go. 

The conditional construction, therefore, has a very similar interactional func- 
tion, the main difference lying in the lack of an explicit response from an ad- 
dressee after the protasis. It is an open question, however, to what extent ZAI 
speakers do or do not signal degrees of awareness of common ground through 
non-verbal means during conversation, as this varies cross-culturally. This is an 
important question to explore in future work. In both cases, LA is used to mark 
the speaker's turn as a procedure for securing referential common ground with 
the addressee(s). 


$From a usage-based perspective, this analysis suggests the notion of (action and grammatical) 
projection (cf. Auer 2005), in the sense that the use of a LA foreshadows a range of possible 
upcoming actions or constructions. 
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The use of LA with the function of securing referential common ground can 
also be seen in cases in which a speaker is constructing a list. An example is 
given in (30), taken from a casual conversation between three male adults. Here, 
LA is used in lines 2, 4, and 5. 


(30) (20120318 C TVA: 5:44-5:54) 


01 


02 


03 


04 


05 


06 


07 


péru ti dxi ánte 
pelÉruti dxi alte 
but one day before 
“But one day before; 
viérne  huaxhinnique lá 
viel Hrne huaxhinni que!" laH 

Friday evening DEM LA 

‘that Friday evening la’ 

uxudxidú 

gu-xudxi-du! 
COMPL=drink=1PL.EXCL 

‘we got drunk’ 

laabe 1a 

laa=be H ]aH 

base=3SG.ANIM LA 

“Him (pointing) la’ 

Vidal lá 

Vidal la! 

Vidal LA 

"Vidal la’ 

ne  náa 

nel! naa 

and 1sc 

‘and I’ 

bide’du jma caguama 
bi-de’=dul# jma! cagualHma 
COMPL-drink=1PL.EXCL much beer 

“We drank lots of beer. 


The LA particle appears in line 2 at the end of an adverbial clause similar to the 
uses discussed above in (21) and (22). In line 4, the speaker uses the third person 
independent pronoun followed by LA to refer to one of his interlocutors (which 


124 


6.3 Summary and conclusions 


he reiterates by simultaneously pointing). In the immediately following line, line 
5, he refers to yet another third person referent (not a participant) using his 
first name followed by rA. He adds one final referent, himself, in line 6, without 
the use of La. Those three individuals make up a group, established over three 
intonation units, who together function as the subject-topic in line 7 referred 
using the 1Pr.Excr enclitic. In this way, the LA particle is used by the speaker 
to help the addressee identify the individuals in question, i.e. secure common 
ground, prior to the predication (cf. Principle of the Separation of Reference and 
Role, Lambrecht 1994). 

In addition to topic marking and topic promotion, then, the use of LA should 
be seen as a resource for organizing talk and for making that organization rec- 
ognizable to the speech participants. This section has shown that an analysis of 
the multifunctional nature of LA depends on the analysis of spontaneous speech 
and, especially, of conversation. It may be useful to investigate the use of LA as 
a resource in the co-construction of talk, in floor-holding, in turn-taking, in turn 
entry points, etc. and, more generally, as a window into the ways in which lis- 
teners orient to speech and conversation. Because listeners in different speech 
communities may orient in different ways, the relevant question thus becomes: 
how might the use of the LA particle be tied to local conversational strategies and 
conversational norms? From this perspective, it is likely that a characterization 
of LA in terms of notions like topic and focus is insufficient, and that insight into 
its functions can be better understood through an analysis of talk-in-interaction, 
ie. of the kinds of interactional work that are being done in conversation and 
how. 


6.3 Summary and conclusions 


This chapter has presented an analysis of the strategies available to ZAI speakers 
to mark various types of topics and topic relations. It explored the relationship 
between pragmatic or cognitive status and topic-hood and found that it is not a 
pre-requisite, but that topic referents usually have a certain degree of pragmatic 
accessibility, where more acceptable topics are higher on a cognitive status scale 
(i.e., the Topic Accessibility Scale, Lambrecht 1994). Because insufficiently acces- 
sible topic referents are more difficult to interpret, the most acceptable topics in 
ZAI were found to be clitics and the least acceptable to be indefinite NPs and 
bare nouns. 

Two main factors, constituent order and nominal form, were observed to inter- 
act closely in the expression of topic relations in ZAI. Verb-initial clauses are com- 
patible with the widest range of pragmatic construals as they can be employed in 
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event-reporting, topic-comment, and identificational constructions. Lexical NPs 
in any of these three construction types typically signal a constituent that forms 
part of the focus domain. Independent pronominal forms, for their part, may sig- 
nal topical or focal material, depending on position and on context. Meanwhile, 
dependent forms, i.e. subject enclitics, are used exclusively for subject-topics. Pre- 
verbal constituents, whether subjects, objects, or adjuncts, are almost exclusively 
focused constituents of identificational constructions. One exception to this is 
the topicalization construction. In TOPICALIZATION constructions, the pre-verbal 
constituent is a subject-topic with a co-referring enclitic on the verb. These are 
used typically in cases of topic promotion. 

A correlation was identified between information structure and certain types 
of constructions and the cognitive status of the referents involved. For example, 
IN FOCUS (Gundel et al. 1993) or ACTIVATED referents do not occur in presen- 
tational or event-reporting constructions. Also, TYPE IDENTIFIABLE referents do 
not occur in “marked topic”, detachment constructions involving the particle LA. 
Therefore, for ZAI, NPs in presentational constructions are never pronominal 
forms, and NPs in detached, La-marked phrases are never indefinite. 

Itis important to note that the analysis of spontaneous speech and, specifically, 
of conversation makes possible a multifunctional analysis of LA. Through this 
analysis, we saw too that LA-marked constructions can have a topic-promoting 
function, but also mark topical information, set the spatial, temporal, or individ- 
ual framework within which the predication holds, and play a discourse cohesion 
role. They mark phrases that function as "scene-setting topics" that have a frame- 
setting or delimiting function. LA-marked constructions also mark contrastive 
topics, indicating changes in topics or boundaries of topical units. 

Furthermore, constructions with LA form part of the background presupposi- 
tions, and establish a framework within which to proceed with the discourse, in 
the same way a question does. LA is, in fact, used in yes/no questions to secure 
referential common ground with the addressee(s). As such, LA can be seen not 
only as a resource for marking various types of topical information, but more 
generally as a resource for organizing talk and interaction. 
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research 


The fundamental aim of information structure studies, and of discourse pragmat- 
ics more generally, is to understand how the same propositional content can be 
expressed in linguistically different ways. In this, it is important to examine the 
syntagmatic relations between the elements of a clause or sentence and the ways 
that these can vary. More crucially, however, the study of information structure 
requires an analysis of the paradigmatic relations between different, but related 
clause or sentence structures. These structures, as they are stored in the memory 
of speakers and hearers, represent alternative ways to structure propositions that 
differ depending on the pragmatic goals of the speaker. In other words, the study 
of information structure involves not only the relationships and orders between 
elements within a clause or sentence, but also the relationships between clauses 
or sentences that are semantically equivalent though formally and pragmatically 
different. These relationships are the paradigmatic relations that hold between 
available alternatives and that speakers and hearers bring to bear to accomplish 
their communicative goals. 

This study examined the paradigmatic relations that hold in ZAI between dif- 
ferent structures on two distinct levels: a) the pragmatic states of the referents 
of individual sentence constituents in the minds of the speech participants, and 
b) the pragmatic relations established between these referents and propositions. 
First, as we saw in Chapters 3 and 4, speakers use the relationships between 
nominal forms, cognitive statuses, and grammatical roles in nuanced ways to ac- 
complish specific communicative and interactional goals, such as to 1) introduce 
and track referents, 2) mark referents as more or less accessible, and 3) mark cer- 
tain referents as more or less thematic. Second, as we saw in Chapters 5 and 6, 
speakers exploit the relations between constituent orders, morphology, and top- 
ical and focal material to 1) distinguish between presuppositions and assertions, 
2) mark shifts of background information or of topical units, 3) signal the focus 
domain of a proposition, and 4) to accomplish interactional goals such as holding 
or ceding the floor in turn-taking in conversation. 


7 Conclusions and avenues for further research 


With these two directions in mind, this chapter presents an overview of the 
main contributions of this study. In this, I discuss the conclusions derived from 
the analysis of the main information structure properties of ZAI, namely: 1) nom- 
inal forms and cognitive status, 2) the LA particle, and 3) topic and focus construc- 
tions. This discussion includes the conclusions reached in the analysis of the use 
of each of these three properties in narrative and conversation including: the al- 
ternation between overt and zero third-person pronominal clitics, the use of the 
particle La, and the parallel, chiastic use of predicate focus and argument focus. 
Included in each section is a discussion of possible avenues for further research. 


7.1 Nominal forms and cognitive status 


This study explored the relationship between form and distribution of nominals 
and between their form and function, analyzing the different forms that are used 
to introduce and track referents and to mark referents as more or less accessible. 
The discussion, framed between Preferred Argument Structure (Du Bois et al. 
2003) and the theory of Accessibility (Ariel 2001), showed that the fundamental 
mechanism driving the tendencies captured by PAS can be traced to the notion 
of accessibility. 

More specifically, the avoidance of new referents and lexical NPs in the A role 
was understood as an avoidance of referents in the A role with a low degree of 
accessibility. The tendency, in other words, is to avoid low accessible As. The result 
is that highly accessible referents with less coding material are likely to occur in 
the A role. In contrast, low accessible referents with more coding material are 
unlikely to occur in that role and, instead, will more consistently occur in the O 
role. The S role exhibits a tendency in between the A and O roles in that it will 
often house previously mentioned, animate, salient, topical, and recent referents. 
At the same time, however, it will often function as a “cognitive staging area” for 
the introduction of new referents at episode boundaries. 

Moreover, because nominal forms indicate the status of their denotations as 
pragmatically more or less available in the speaker or hearer's mind, the forms 
of nominals that speakers use depend on the assumed cognitive status of the 
referents involved. That is, they depend on assumptions that a speaker can rea- 
sonably make regarding the addressee's knowledge and attention state in the 
specific context in which the form is used. Therefore, not only does type of nomi- 
nal expression correlate with grammatical role, but with cognitive status as well. 

It is important to note that pragmatic or cognitive status is not a pre-requisite 
for topic or focus-hood, although it may play a role. Because insufficiently acces- 
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sible topic referents are more difficult to interpret, topic referents usually have 
a certain degree of pragmatic accessibility, where more acceptable topics are 
higher on a cognitive status scale (i.e., the Topic Accessibility Scale, Lambrecht 
1994). The least acceptable are indefinite NPs and bare nouns. The most accept- 
able topics in ZAI are clitics. Related to this, it was observed that the inanimate 
object enclitic, although inconsistent, is employed relatively frequently for top- 
ics (cf. example (20). One goal of future work should be to pay close attention 
to this use. 

Correlations were also found between information structure of certain types of 
constructions and the cognitive status of the referents involved. IN Focus (Gun- 
del et al. 1993) or ACTIVATED referents do not occur in presentational or event- 
reporting constructions. TYPE IDENTIFIABLE referents do not occur in "marked 
topic”, detachment constructional involving the particle La. Therefore, for ZAI, 
NPs in presentational constructions are never pronominal forms, and NPs in de- 
tached, LA-marked phrases are never indefinite. Presentational constructions are 
often used to introduce new, human referents, but new referents, either human 
or not human, can also be introduced in the O role using topic-comment con- 
structions. 

Chapter 4 focused on the pragmatic status of the two third person pronominal 
forms, the zero and the overt subject enclitic form, exploring the distribution and 
alternation of these forms in narrative and conversation. While the overt form 
was found to have a broader set of binding conditions than the zero form, the 
choice between the two forms is free at the main clause level. In those cases, an 
important discursive factor governing their use is the relative thematic salience 
of the referents. Because the overt pronoun is used for more thematic figures and 
the zero for less thematic figures, speakers must make active choices in contexts 
involving multiple third-person participants about which pronoun to assign to 
each. The study of narrative and conversational contexts is therefore crucial for 
understanding how speakers and hearers evaluate the relative thematicity of par- 
ticipants and use linguistic resources to do so. 


7.2 Topic and focus constructions 


At the center of information structure in ZAl is the flexible nature of constituent 
order. As we saw, the extent to which phonetic and intonational cues play a role 
in the expression of the cognitive status of referents was found to be minimal, 
and information structure categories and relations are expressed mainly through 
manipulation of constituent order. 
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Verb-initial clauses are compatible with the widest range of pragmatic constru- 
als as they can be employed in all topic-focus construction types: event-reporting, 
topic-comment, and identificational constructions. Constituent order, however, 
adapts to discourse functions, and verb-initial syntax in ZAI is frequently vio- 
lated in constructions in which topicalized and focalized elements may often ap- 
pear before the verb. For this reason, we described ZAI as syntactically relatively 
flexible. In addition, because the focus domain is mostly tied to the pre-verbal 
position, ZAI can be described as pragmatically relatively rigid. Pre-verbal con- 
stituents, whether subjects, objects, or adjuncts, are almost exclusively focused 
constituents of identificational constructions.! 

Therefore, focus structure in ZAI may motivate certain syntactic arrangements. 
The reverse, that syntactic arrangements motivate changes in the focus domain, 
is never the case. 

Moreover, constituent order interacts closely with nominal form in the expres- 
sion of topic and focus relations in ZAI. Lexical NPs in any construction type 
typically signal a constituent that forms part of the focus domain. Independent 
pronominal forms, for their part, may signal topical or focal material, depend- 
ing on position or context. Meanwhile, dependent forms, i.e. subject enclitics, 
are used exclusively for subject-topics. A focused subject cannot appear as an 
enclitic on the verb. 

Finally, it was noted that both verb-initial and non-verb-initial structures ex- 
ploit positions of prosodic prominence at the beginning and end of IUs. As we 
saw through an analysis of the use of different focus structure constructions in 
narrative and conversation, these positions are exploited in the parallel, chiastic 
use of predicate focus and argument focus. 

In this sense, while there is no evidence for pitch accents associated with top- 
ical or focal material, it is possible that there may be a prosodic motivation for 
the various types of constituent orders and for the pragmatic motivations un- 
derlying their use. The search for description and explanation in this dimension 
would benefit greatly from a detailed, systematic study of the range of intonation 
patterns employed by ZAI speakers and their relation to the diversity of informa- 
tion structure categories and constructions. Ideally, this study could be extended 
or related to similar phenomena in related Zapotec languages. 


¡One exception to this is the topicalization construction, in which the pre-verbal constituent 
is a subject-topic with a co-referring enclitic on the verb. These are used typically in cases of 
topic promotion. 


130 


73 Theta discourse particle 


7.3 The LA discourse particle 


The discourse particle LA is involved in expressing information structure in ZAI. 
As we saw in Chapter 6, LA-marked constructions can have a topic-promoting 
function, but also mark topical information, set the spatial, temporal, or individ- 
ual framework within which the predication holds, and play a discourse cohesion 
role. They mark phrases that function as "scene-setting topics" can have a frame- 
setting or delimiting function, mark changes in topic or boundaries of topical 
units, and/or function as contrastive topic markers. 

More generally, constructions with LA form part of the background presuppo- 
sitions and establish a framework within which to proceed with the discourse, 
in much the same way that a question does. As was pointed out, there are, in 
fact, similarities between the use of LA in yes/no questions and in LA-marked 
or detached phrases in that both are used to secure referential common ground 
with the addressee(s). From this perspective, LA functions as a try-marker and as 
a resource for negotiating common ground. 

As with the analysis of the overt versus zero alternation in third person pro- 
nominal forms, the multifunctional analysis of LA also requires the analysis of 
spontaneous speech and, specifically, of conversation. It is likely that the use 
of LA is tied to the ways that ZAI speakers signal degrees of awareness of com- 
mon ground in interaction through not only linguistic means but also non-verbal 
means. An analysis of multi-modal interaction would no doubt be extremely 
worthwhile to begin to understand how forms such as this are employed and 
how they fit into local conversational norms about the kinds of assumptions that 
are made explicit linguistically between speakers and hearers and which are not. 

Because listeners in different speech communities can orient themselves in 
different ways, the following question is posed: How can the use of the particle 
be linked to local conversational strategies and norms? From this perspective, 
probably a characterization of LA, as well as a more general characterization 
of the focal structure of the ZAI in terms of notions such as topic and focus 
is insufficient (see Matié & Wedgwood 2013; Ozerov 2015). Instead, it is likely 
that the uses of the focal structure will be better understood through an analysis 
of the interaction; that is, through an analysis of the types of interactions that 
participants are having in the conversation and why. 
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N: 01 ¿randa guini’lu xi biiyalu? 
randalM!  guil-ni'-luxi ^ bi-iya-lu 


2sc-be.able Por-say-2sc what COMPL-see-2sG 


“Can you tell what you saw?” 


T: 02 zandá pue 
LH. !H pues 


FUT-can=1sG well 


z-anda 


“Well, I can’ 


N:03 ¿xi biiyalu? 
xi bi-iya=lu 
what COMPL-see-2sG 


"What did you see?' 
T: 04 bihuiini lu ni lá 
bi-huiini lu nilH ]aH 


COMPL-appear face 3SG.INAN LA 


“There appears; 


05 ti rígola cuchuugu caadxi  cuánanaxhi 
ti riPlgola cuu=chuugu”  caadxil!! cuananaxhi 
one man  . PROG.CAUS-Cut few fruit 


“a man cutting some fruit’ 


06 rígola que lá 
riHgola quel! laH 


man DEM LA 


‘that man; 


Appendix A 


07 má bichabe chüpá | dxümí ní bíchuugubé 
ma'H  bi-cha-belH chupa!" dxumilH ni bi-chuugu-belE 
already COMPL.CAUS=fill=3.HUM two basket REL COMPL-cut=3.HUM 


08 


09 


10 


11 


12 


13 
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“he had already filled two baskets of pears that he cut' 


raque  cúchabe guíra péra  cuchugubé 
raquel! c.u=cha=be H guiralH pelHra cu-chugu-belH 
then  PROG.CAUS=put.in=3.HUM all pear PROG=cut=3.HUM 


‘then he was putting in all the pears he was cutting’ 


dxí'babe lú yaga qué 

dxi'H ba-belH]u yaga quel 

climb=3.HUM face tree DIST 

“(he was) up in that tree’ 

qué fiannadíbé bédanda tí xcuidihuiini 
que! iüa-nnalH-di-pelH be-dandal! ti xcuildi-huiini 


NEG IRR-know-EMPH-3.HUM COMPL=arrive.there one boy-DIM 


“he didn't know a boy arrived there” 


dxí'ba ti bicicléta 
dxiHba-Q ti bicicleta 
PART.climb-3 one bicycle 


“(he was) on a bicycle' 


gucaa ti dxumi  péra qué 
gu-caa-O ti dxumilH pelra que 
COMPL=put=3 one basket pear DIST 


LH 


‘(he) put that basket of pears’ 


bidxi’ba lu xpicicléta 
bi-dxi'Hba-O ^ lu x=bicicle™ta=@ 
COMPL-climb-3sc face poss=bicycle=3 


‘(he) got on his bicycle’ 


14 


15 


16 


17 


18 


19 


20 


ne  bíree ze 


nel! bi=ree=0 zel ig 

and COMPL=leave=3 PART.go=3 

“and (he) left 

gula'na xcuídi que  dxümí  péra  stibé 


gu-lana  xcui'di que! dxumilH pera sti H- pel H 


coMPL-stealboy DEM basket pear  Poss-3.HUM 


“that boy stole his basket of pears' 


huaxa neza ze xcuídi que lá 
huaxa neza ze xcuildi que! laH 


but path PART.go boy DIST LA 


‘but on the path that the boy went; 


málásí bídxaagabé tí badudxaapahuiini 
maHlasi!H þi-dxaagaH=be!" ti badudxaapa-huiini 
suddenly coMPr-cross-3sG INDEF girl-DIM 


‘suddenly he crossed a little girl’ 


dxí'ba sti — bicicléta 
dxiHba-Q stilH bicicleta 
PART.climb-3 other bicycle 


“(she was) on another bicycle’ 


badudxaapahuiini que — güxha zia banda nuu íquébé 
badudxaapa-huiini que! gu-xha=0 ziña banda’! n-uulH ique-belH 
girl-pim DIST COMPL-knock=3 palmshade — srAT-be head-3sG 


'the little girl knocked the hat that was on his head' 


óra  bidxiguetalube bíiyabe bádudxaapa que 
olHra bi-dxiguetalu-belH ^ bi-uuya-belE badudxaapa quel HH 
when COMPL-turn=3SG.ANIM COMPL-see=3SG.ANIM girl DIST 

lá 

]aH 

LA 


‘when he turned and saw that girl’ 
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21 bidxelasaa biciclétanebé tí guieroo' ba 
bi-dxela-saa biciclePta-neH-belH ti guie-roo’ba 
COMPL-find-RECIP bicycle-with-3sG.ANIM one stone-AUG 


‘he crashed his bike against the rock’ 


22 biabantaabé 
bi-abantaa-belH 
COMPL-fall.hard-3sc.ANIM 


‘he fell’ 


23 bireeche  dxumi  péra  stibé 
bi-creeche | dxumi!! pera st HH pel 
COMPL-spill basket pear  POSS-3SG.ANIM 


‘his basket of pears spilled’ 


24 laabe la 
laa-belH la 
BASE=3SG.ANIM LA 
‘he, 

25 biiyadxisibé bádudxaapahuiini qué 
bi-uuyadxisiH=be H badudxaapa-huiini que! 
COMPL-see.fixedly=3sG.ANIM girl-DIM DEM 


“he looked at that little girl? 


26 raque lá 


raquel! laH 
LOC-DIST LA 
‘then’ 

27 mála ze chonna  xcuídihuiini 
malla ze chonnal xcuidi-huiini 
suddenly FUT.go three kid-pIM 


‘suddenly three little kids’ 
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28 


29 


30 


31 


32 


33 


34 


badunguiiuhuiini laacá 


badunguiiu-huiini laacab#! 

boy-DIM also 

little boys also” 

gucanecá laabe bídopa guírá  pérá 
gu-ca-nel-ca-o laa-bel! bi-dopalH guiralH pelH 
COMPL-help-with-PL-3sG BASE-3sG COMPL-pick.up all pear 


“(they) helped him pick up all the pears' 


bichaacani ní dxümi 


bi-chaa=ca-nilH ni dxumi 


COMPL-put.in=PL-3SG.INAM LOC basket 


LH 


‘they were put in the basket’ 


ne  bídxi' babe ní biciclétá  stíbé 
nel! bi-dxi'Hba-belH ni bicicleta sti Ha pel H 
and compPL-climb=3sG Loc bicycle — Poss-3sc 


‘and he got on his bicycle’ 


zizabé 
z-iza=belH 


PROG-walk=3sG 


“and went walking' 
guiónna' badunguiuuhuiini que lá, 


guio"nna' badu-nguiiu-huiini quel laH 
third child-man-Dim DIST LA 


“those three boys; 


gudí dica, 
gu-di'Hdi-ca-9 
COMPL-cross=PL-3 


“(they) crossed, 


qué 
ra quelH 


DIST 
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35 zéca 


36 


37 


38 


39 


40 


4 
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m. 


zeH =ca-Ø 
PROG.gO=PL=3 


“(they) were leaving’ 


óra  biiyaca nexhe ziña bandá 


olHra bi-iya=ca-@ nexhe ziña banda'H 
when COMPL-see=PL-3 lying palm shade 
lá 

laH 

LA 


‘when they saw his hat lying on that path’ 


gundisácá ni 
gu-ndisa'H -ca-Q nil H 


COMPL-lift-PL-3 3sG.INAM 
(they) picked it up’ 
ne  bíbiguetaca 


ne! bi-bigueta=ca-D 
and COMPL-return=PL-3 


“and went back’ 


bicaca stiipi laabé 
bi-ca=ca-D stii-HpilH laa=be H 
COMPL-put=PL-3 whistle BASE=3SG 


“(they) whistled to him’ 


ne  güyeca ra nuubé 
nell gu-ye-ca-O ra n-uu-belH 
and COMPL-go-PL-3 LOC STAT-be-3sc 


“and (they) went to where he was; 

bidiica ziña bandá?  stibé 
bi-dii=ca-Ø ziña banda’! sti Hz pel H 
COMPL-give-PL-3 palm shade — Poss-3sc 


(they) gave him his hat’ 


stibé 


lú neza que 


stiH=belH lu neza quel 


POSS=3SG 


face path DIST 


H 


42 


43 


44 


45 


46 


47 


48 


laabe óraque lá 


laa=be™ olHraquelH laH 

BASE-3sG then LA 

“then he; 

gucuabe chónná  péra 
gu-cua=belH chonnalH pera 
COMPL-choose-3sc three pear 


“he chose three pears' 

bidiibe cába'duque né  bíreébé 
bi-dii-be- ca ba'du quel! nelH pi-ree- pel E 
COMPL-give-3sc PL child pisr and compt-leave=3sc 


‘he gave those kids and he left’ 
zinébé xpíciclétábé 
ziH-nelH-pelH ^ x-bicicleta=bel# 
PROG.go-with=3sG Poss-bicycle-3sc 


“he went with his bicycle' 


caba'duque lá 
ca ba'du quel! aH 


PL child DIST LA 


‘those children La’ 
gudi dica neza 


gu-di'di-ca-2 neza 
COMPL-pass-PL-3sG path 


(they) crossed along the path’ 
zéca 


zeH-ca-Q 
PROG.g0-PL-3 


(they) left’ 
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49 


50 


51 


52 


53 


54 


55 
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gucuaca tipéra cada tobi ca 
gu-cua-ca-Q ti pera cada tobi ca 
COMPL-choose-PL-3 a pear each one DET 


“(they) chose a pear each' 


yendaca ra nuu dxa yaga péra 
gu-yenda-ca-O ra n-uu dxa yaga pera 
COMPL-go=PL-3 LOC STAT-be full tree pear 


(they) went to where the full tree of pears was’ 


ra dxíba dxarígola que 
ra dxi'Hba dxa ri? gola quel 
LOC climb full old.man pisT 


H 


"where the man was up on' 


rígola que lá 
riHgola quel! aH 


old.man DIST LA 


‘that man’ 

óraquepe má biete de lu yaga qué 
olHraquepe ma! bi-ete=Ø de lu yaga que!" 
when already COMPL-go.down=3sG from face tree DIST 


“when (he) came down from that tree’ 


lu tiyaga cue ni 
lu tiyaga cue’ ni 
face a tree side 3sG.INAN 


LH 


*on the side of the trunk of the tree' 


raque  bíete 
LH piete- 
raque iete=D 
then — coMPr-go.down-3sc 


‘then (he) came down’ 
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57 


58 


59 


60 


61 


62 


óra  biiya lá 
ora bi-iya=0 laH 
when COMPL-see-3sG LA 


“when (he) saw? 
cayaadxa tidxumi  pérá sti 


ca-yaadxa' ti dxumil! pera sti4=o 
PROG-miss a basket pear POSS=3SG 


“a basket of his pears was missing’ 


que  gánna tu la gucua ni ni 
quel! g-anna-Q tu! lal! gu-cua nil nilH 
NEG  POT-know who name coMPL-grab 3 3SG.INAN 


‘he didn't know who grabbed it’ 


biiyadxisibe guiónna' badunguiuhuiini qué 
bi-iyadxisi-H - pel guio"nna' badunguiiu-huiini que! 
COMPL-see.fixedly-only-3sc third boy-DIM DIST 


“he looked fixedly at those three little kids' 


óra  gudí' dica ra nuubé 
olHra gu-di'Hdi-ca-D ra n-uu-belH 
when COMPL-pass=PL-3 LOC STAT-be-3sc 


“when (they) passed by where he was' 


ne  [guza-] gudi’dica 
nel! [guza-] gu-di'Hdi-ca-9 


and COMPL-pass=PL-3 
“and (they) passed” 
zéca ti neza qué 


zelH=ca-Ø  tineza quel H 
PROG.gO=PL-3 a path DIST 


‘(they) went on that path’ 
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63 laabe qué fiannabe tú lá gucua dxumi  pérá 
laa-belH que ü-anna-bel! tu la gucua dxumil4 pera 
BASE=3SG NEG IRR-know=3sG who name COMPL-pick basket ^ pear 
stibé 
sti H -pelH 
POSS=3SG 


‘he would not know who took his basket of pears’ 
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T: 001 


002 


003 


004 


005 


006 


dxi que  nalasébé 
dxi que“ nalase'H-be 
day DEM  thin-3sG.HUM 


H 


*Back then he was thin' 


laabe lá 
laa-bel.H ]aH 
BASE=3SG.HUM LA 


“as for him' 
ma biiyabe 


ma  bi-iya-belH 

already COMPL-see=3SG.HUM 
“he already saw” 

bia' 

bia' 

about 


‘about’ 


bia’ nalasébé 
bia  na-lase'H-bebelH 
about sTAT-thin=3sG.HUM 


‘he was pretty thin’ 


nalasébé 
na-lase'H pel 


STAT-thin-3sG.HUM 


‘he was thin’ 
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007 nabé nalasébé 


nabe! na-lase H=belH 


very STAT-thin=3SG.HUM 


“he was very thin' 


M: 008 dxi que nua México mecánico  laabé 
dxi quel! n-uubH=4'H México meca nico laa-bel 


day DIST  STAT-to.be-1sc Mexico mechanic  BASE=3SG.HUM 


“Back then I was a mechanic in Mexico City” 


009 xcuidihuiini xa 
xcuidi-huiini' xa 
child-pim INTJ 


‘a child’ 


010 muchachuhuiini' 
muchachu-huiini’ 
young.man-DIM 
“a young man’ 

011 dxi bixooñé ja 
dxi bixooñe' ja 
day compt-run-a’! INT] 
“when I ran, huh' 


012 maratón internacionalque lá 
maratón internacional que! laH 

marathon international DIST LA 
“the international marathon; 

T: 013 aja 
aja 
yeah 
‘Yeah’ 
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M: 014 ¿xi lani? 
xi  la-nilH 


what name-3sG.INAN 


"What was it called?” 


015 má nápá veintidós iza 
ma'H  na-apa-a'H 


already srAT-have-1sc twenty-two year 


veintidos! iza 


‘I was already 22 years old’ 


016 veintidós iza napa dxi qué 


H H 


veintidos! iza na-apa-a' dxi quel 


twenty-two year sTAT-have=1sG day DIST 


‘I was 22 years old then’ 


017 lu nověnta-y-dos 
lu nověnta-y-dos 
PP ninety-two 


“in £92’ 


018 lu iza nověnta-y-dos 
lu iza novénta-y-dos 
PP year ninety-two 


“in the year ‘92’ 


019 pěro nagasi má nuunu dós mil dóce 
pero nagasi H ma! n-wu=nul# dos mil dolce 
but now already sTAT-be=1PL.INCL two thousand twelve 


‘but now it’s already 2012’ 


T:020 ¿ma panda iza? 
ma'H  pandalH iza 
already how.many year 


'How many years ago?' 
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M: 021 gandé 
gande!H 
twenty 


“Twenty” 


T: 022 ma bia’ gande  íza 
ma bia  gande"" iza 
already about twenty year 


"Already about twenty years' 


M: 023 má raca gande  íza 
ma raca gande!” iza 


already HAB-occur twenty year 


‘It’s already been twenty years’ 


T:024 ¿pabiá ti lidxi que ya? 
pabiaH ti lidxi que" ya 
how.much one house DIST Q 


“How much did a house cost?” 


M: 025 bia  nasoolo namás que jmá nalasé xa 
bia’ na-soo-lu namas! que jma! na-lase'H-a'H xa 
about sTAT-tall=25G only that more srAT-thin-1sG INTJ 


‘I was about the same height, I was just thinner’ 


026 nalasébé biá naa 
na-lase'H-pelH bia’ naa 
STAT-thin=3sG.HUM about 1sG 


‘he was thin like me’ 


T: 027 ¿panda  íza- 
pandalH iza 
how.many year 


€ > 
How many years 
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028 


M: 029 


030 


T: 031 


032 


033 


M: 034 


¿panda este kílómetro  bixooñelu raqué? 
panda! este kilolHmetro bi-xoofie=lu’ raquel 


how.many INTJ kilometer | coMPL-run-2sc then 


'How many, um, kilometers did you run then?' 


cuarénta-y-dos 
cuare" nta-y-dos 
forty-two 


‘Forty-two’ 


cuarénta-y-dos kilómetro 
cuare nta-y-dos kilo! metro 
forty-two kilometer 


‘forty-two kilometers’ 


chupa chónná gubidxa zeedandarú dxi guxooñelu qué 
chupa!" chonnaH gubidxa z-eedandaH-ru dxi gu-xooñel=u' que! 
two three sun PART=arrive=still day POT-run=2sG DIST 


"Two or three days would pass while you'd be running’ 


¿binu xiixa éjércicio 1a? 
bi-i'ni-lu' xii HxalH ejerciMcio la" 


COMPL-do-2sG something exercise LA 


“Did you do some exercise?” 


¿o laaca casi biasalu lu cama zuxoofielu? 
o laaca casi bi-asa-lu' lu calHima zu-xoofie-lu' 
or same as COMPL-get.up=2SG PP FUT-run-2sG 


“Or just as you got out of bed you went to run? 


pues normál xa 
pues normal" xa 
well normal INTJ 


“Well, normal 
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035 ejercício ira dxí 
ejercicio guira’ dxi 
PL exercise all 


T: 036 


M: 037 


038 


039 


T: 040 


04 
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m. 


‘I did the exercises every day” 
jmalHcá lá? 

malH cal! laH 

really LA 

‘Really?’ 


naa siémpre uxóo ne’ 
naa siempre guH-xoo ne=a’ 
1sG always POT-run=1sG 


‘I would always run’ 


puro de chii kilómetro 
puro de chii kilometro 
all of ten kilometer 


“all ten kilometers” 


xhono kilómetro 
xhono kilo metro 
eight kilometer 


‘eight kilometers’ 

¿pabiá uxoóñelu ira dxi ya? 
pabia'H ^ gulH xooñe=lu guira’! dxi ya? 
hom.much Por-crun-2sc all day Q 


'How much would you run every day?' 


¿chii kilómetro ti dxi ruxoofio la? 
chii kilometro ti dxi ru-xooiie-lu' la! 
ten kilometer one day HAB-run-2sG LA 


"You would run ten kilometers a day?' 


M: 042 


T: 043 


M: 044 


045 


046 


047 


048 


siadosi 
siaHdo' - si H 
morning-only 


‘Just in the morning’ 


ya aja 

ya aja 

ok yes 

“Ok, yes” 

guxoone jaa 

gulH xooñe=a' Pl jaa 
POT-run=1sG INTJ 

‘I would run, huh’ 

pa  xhónó  kílómentro lá 
pal xhonolH kilometro laH 


if eight kilometer rA 


'either eight kilometers' 


o chii kilometro 
o chii kilo“! metro 
or ten kilometer 


*or ten kilometers' 


dedea la cínco dela mafiána lá 
dedea la ciÉnco de la mañalHna laH 


PP atthefive  PPthemorning LA 


‘from five in the morning’ 


hasta las séis-y-media dela mañána 


hasta las seisH-y-media de la mañana 


PP thesix-thirty PP the morning 


‘until six-thirty in the morning’ 
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049 


050 


T: 051 


M: 052 


T: 053 


M: 054 


057 


150 


párque Tezozómoc este 
parHque Tezozo-H 
park Tezozomoc INTJ 


moc este 


"Tezozomoc Park, um; 


delegación Azcapotzálco dela tabácalera bučno 
delegacion Azcapotzalco dela tabalHcalera, bueno 
district ^ Azcatpotzalco PP the Tabacalera well 


“Azcapotzalco District in the Tabacalera [neighborhood], well’ 


ah gaxha dera panteón este 
ah gaxha dera panteon! este 
INTJ close PP LOC mausoleum INT] 


“Ah, close to the mausoleum, um' 


panteón, este, panteón San Isidro 
panteonH este panteonH San Isi! dro 


mausoleum INT] mausoleum San Isidro 
*Mausoleum, um, San Isidro mausoleum' 
cádi zitu ndi 

calHdi zitu ndi HH 

NEG far DEM 


‘It’s not far’ 

cádi zitu ndi 
caldi zitu ndi LH 
NEG far DEM 


‘It’s not far’ 


gaxha dera métro este Rosário 
gaxha de ra me'tro este Rosario 


close PP LOC metro INT] Rosario 


“close to the Rosario metro [station]' 


T: 058 


M: 059 


060 


061 


062 


T: 063 


M: 064 


mápe nga  zítu nuú métro Rosário 


LH LH, 


ma’pe ngal! zitu n-uu metro Rosario 


already NGA far srAT-to.be metro Rosario 


‘It’s far from the Rosario metro [station]’ 


ya, métro Rosário lá 


LH 


ya metro RosalHrio la! 


INT] metro Rosario LA 


‘Rosario Metro’ 


rari’ 
rari LH 


here 


‘is here’ (gestures with right hand) 


ne  pánteón SanIsidro  cheri’ 
nel! panteon! San Isi!H dro cheri’! 
and mausoleum San Isidro here 


H 


“and San Isidro mausoleum is here” (gestures with left hand) 
bia' 

bia' 

about 


*around here' 


¿raque bíxoo nelu panda kilómetro? 
raquel" bi-xoofie-lu! panda!" kilo! 


then COMPL-run-2sG kilometer 


metro? 


“Then you ran how many kilometers?” 


co, raque rari guné’ entrenár 


LH H 


co raquel! rari LH. gu-ini-a' s 


entrenar 
no then here  ComPL-do=1sG train 


*No, I trained here' 
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T: 065 ya 
ya 


066 ¿bixooñelu raque panda? 
bi-xoofie-lu! raquel" panda!" 
coMPL-run=2sG then how.many 


‘You ran how many?’ 


M: 067 chupa kilómetro napani álrededór 
chupa!" kilo-Hmetro na-apa-ni-! alrededor" 
two kilometer  sTAT-have=3SG.INAN around 


“It is two kilometers around’ 


T: 068 ¿panda  buélta? 
panda! buelta? 
how.many lap 


'How many laps?' 


M: 069 pue udieé ní tápa buélta nga 
pue gu-diee-a'H ni tapa buelta nga! 
well cCoMPL-give=1SG 3sc.INAN four lap NGA 


"Well, four laps is eight kilometers' 


070 udieé ní gáayu lá 
gu-diee-a'M pit gaayu' laH 


COMPL-give=1sG 3sG.INAN five LA 
'five laps; 


071 chii kilómetro xa 
chii kilo H metro xa 
ten kilometer INTJ 


‘ten kilometers 
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xhóno kilómetro 


LH kil oLH 


xhono metro 


eight ^ kilometer 


072 


073 


T: 074 


075 


M: 076 


077 


078 


pa  üdieé ní xhóopá lá 
pal! gu-diee-a'H ^ gjlH xhoopa' la! 
if | COMPL-give=1sG 3SG.INAN six LA 


^f six 


nga  dóce kilómetro 
ngal! doHce kilo-H metro 
NGA twelve kilometer 


‘that is twelve kilometers’ 


yannadxi bixooñelu este lu maratón qué lá 
yanna-dxi bi-xoofie-lu' este lu maratonH que 
now-day COMPL-run=2SG INTJ PP marathon DIST LA 


‘Now, when you, um, ran the marathon,” 


¿panda kilómetro? 
pandalH kilometro? 
how.many kilometer 


“how many kilometers?” 


cuarénta-y-dos 
cuareÉnta-y-dos 

forty-two 

‘Forty-two’ 

dxi guuya qué la 
dxi gu-H-uuya-a'H que! laH 
day POT-see=1SG DIST LA 


“when I saw that; 


ucaa diaaga 
gu-caa diaaga 
COMPL-put ear 


listen’ 


LH ]aH 
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079 


080 


081 


082 


083 


084 


085 


154 


nacúbi jaa unuá' México 
nacul'Hbijaa gu-unulH-a'H Mexico 
new INTJ COMPL-travel=1sG Mexico 


T had just travelled to Mexico’ 


como raque  uyuu Vidál jmá huaniisi 
como raquel! gu-yuu Vidal" jmaH huaniisil 


as then  compti-go Vidal Por.go-1sc already 


*because then Vidal went, he was older 


que bi’nibe dxíi na nabé 
quel! bi-i'ni-bel dxiiña na-belH 
NEG COMPL-do-3sc.HUM work say=3SG.HUM 


“he didn't work, he says' 


uye Tomás yeganna  láadü 
gu-e Tomas! yegannal! laadu! 
COMPL-go Tomás  POT.visit BASE=1PL.EXCL 


"Tomás came to visit us' 


uyebe yéndabe á- 
gu-e-belH yenda- 
COMPL-g0=3SG.HUM POT.arrive=3SG.HUM a- 


a- 


‘he went to arrive at-’ 

quí gannadia pa ti lúnés la 
qui! g-anna-di=a H palH ti lunes laH 
NEG COMPL-know-NEG-1sc if | one Monday LA 


'I don't know if on a Monday; 


o pa tí dómíngo 
o palHti domingo 


orif one Sunday 


‘or if on a Sunday’ 


086 


087 


088 


089 


090 


091 


092 


o éntre semalHna 


(0) entre semana 
or between week 


“or in the middle of the week’ 
ma zédá maratón 


ma zeelHdaH maraton" 
already FUT.come marathon 


“the marathon would come soon' 


pa  láabé yéndábe raque lunes lá 
pal laa-pel E yenda-belH raquel! lulHnes laH 
if  BASE=3SG.HUM COMPL-3sG.HUM then Monday LA 


“if he came then Monday’ 


o márte 
(0) marHtes 
or Tuesday 


“or Tuesday” 


domíngo que lá 
domi" ngo que!" laH 
Sunday DIST LA 


‘that Sunday’ 
nga ma ra nga maraton 
nga ma'H ra ngal! maraton! 


DEM already LOC NGA marathon 


'that was already when the marathon was' 


domingo que ún veintiséis de abril 
.H LH . . . H d 3H 
domi ngo que" un veintiseis " de abril 


Sunday DIsT a twenty-six of April 


“Sunday, a twenty-sixth of April’ 
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093 veintiséis lá 
veintiseisH laH 
twenty-six LA 


‘twenty-six’ 


094 o veintidós de abríl pue 
o veintidos! de abrill" pues 
or twenty-two of April well 


“or, well, twenty-two of April 


095 yendabe México 
yenda-bel Mexico 
COMPL.arrive-3sc.HUM Mexico 


‘he arrived in Mexico’ 
096 para, jaa, cayuidu diidxa pues 


para jaa ca-ui'-dulH diidxa' pues 
for INTJ PROG-speak=1PL.ExcL word well 


“for, well, us to talk' 


097 como riuuladxibe guébé lá 
como ri-uu-la'dxi'-belH guebel la 
as HAB-enter-liver=3sG.HUM POT-drink-3sG.HUM LA 


H 


“because he likes to drink 


098 para bedandádú Sbado qué 
para bi-edanda!HdulH Sabado quel 
for COMPL-arrive.here=1PL.ExXCL Saturday DIST 


“for us to arrive that Saturday” 


099 bini citárcabe láadú 
bi-ini citar -ca-belH laa-dulH 
COMPL-do make.appointment=PL=3sG.HUM BASE=1PL.EXCL 


“they made the appointment for us” 
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100 


101 


102 


103 


104 


105 


106 


chuudu tí reunión 


chuudul ti reunion! 


POT.go=1PL.EXCL one meeting 


€ . > 
we went to a meeting 


ra jaa Hotél Camino Reál, México 
ra jaa Hotell Camino Real! MeHxico 
LOCINTJ Hotel Camino Real Mexico 


“at the Camino Real Hotel, Mexico City” 


bidii gueela chuudu 


‘night came we went to the Camino Real Hotel 


bidiicabe náati playérá lá 
bi-dii-ca-bel naati playelHra la? 
COMPL-give=PL=3SG.HUM 1sG one shirt LA 
'they gave me a shirt; 

ne  nümero 


nelH nu.Hmero 


and number 


“and a number? 


ne número lá 


LH nulH 


ne mero laH 


and number LA 


“and a number; 


para racá identificár 
para raca identificar 
for HAB-occur identify 


H 


‘to identify [us]’ 


Hotél Camino Real 
bi-dii gueela’ chuu-duH Hotel! Cami" no Real 
COMPL-give night POT.go=1PL.ExcL Hotel Camino Real 
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107 nvumero maizéna 


numero maize! na 


number Maizena 


“number Maizena’ 


108 deti, este, ¿xi módó ngué? 


de ti, este, xil! modo nguelH 
of one INT] what way DEM 
*of a, um, what is that? 
T: 109 ti diidxa' 
ti diidxa' 
one word 
“a word' 
M: 110 ti plática biucabé 
ti platica bi-uu-ca-bel! 


one conversation COMPL-enter=PL=3SG.HUM 


*a conversation' 


111 casiti entrenamiénto 


LH 


casiti entrenamie~ nto 


like one training 


‘like a training session’ 


112 péru ti dxi ante 
pelHruti dxi ante 
but one day before 


‘but one day before’ 


113 viérne huaxhinnique lá 


vie“rne huaxhinni quel ]aH 


Friday evening DEM LA 


‘that Friday evening’ 
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114 


115 


116 


117 


118 


119 


120 


uxudxidú 
gu-xudxi-dul! 
COMPL=drink=1PL.EXCL 


“we got drunk 


laabe lá 
laa-bel.H ]aH 
base=3sG.ANIM LA 


‘him (pointing) 
Vidal lá 


Vidal la! 
Vidal La 


"Vidal 


ne  náa 


nelH naa 


and 1sG 

“and T 

bide'du jma cáguáma 
bi-de'-dulH jma! cagua ma 
COMPL-drink=1PL.ExCL much beer 


“we drank lots of beer” 


hasta ti botélla de bacardí bide'du ráqué 
hasta ti botella de bacardi" bi-de'=dulH raquel! 
even one bottle of Bacardi ComPL-drink=1PL.ExCL then 


“we even drank a bottle of Bacardi’ 


bira guéela sábado qué 
bi-ralH ^ gueela' sabado quel 
coMPL-end night Saturday DIST 


“Saturday at dawn' 
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121 guye cita qué 
gu-e-a' cita que 
COMPL-e-1sc date NEG 


H 


‘I went to the appointment’ 


122 chaa 
chaa 
POT.go=1sG 


LH 


*I go 

123 qui üyédiá dxiiña 
quel gu-e-di=a dxiiña 
NEG  COMPL-go-NEG-1sG work 


‘I didn't even go to work’ 


124 naxudxerua’ 
naxudxerua' 
STAT-drunk-still=1sG 


‘Twas still drunk’ 
125 yendayá 


gu-enda=a’ 
COMPL-arrive=1sG 


H 


“when l arrived” 


126 para bira guéela bicaacabé lá 
parabi-ra!M  gueela' bi-caa=ca=be!# la 
for COMPL-end night COMPL-put=PL=3SG.HUM LA 


H 


“for when the night ended, they called” 


127 ne güdxi que raca uleza stale stálé binni  úye 
nel" gu-dxi que r-aca uleza stale“ stale“ binni" gu-e 
and coMPL-saythatHAB-occurcOMPL-waitmany many people COMPL-go 


'and it was said that many many people were expected to come' 
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128 


139 


130 


131 


132 


133 


134 


stale  stálé 
stale“! stalelH 


many many 
“many many” 


miles 
milHes 
thousands 


“thousands” 


para nuá xa 


LH_,>H 


para n-uu* =a xa 


for sTAT-to.be-1sG.HUM INTJ 


“for me there; 


pues naa lá 
pues naa laH 
well 1sG La 


“well as for me, 


pues uyé normal xa 
pues gu-e=a 
well comPL-go=1sG normal INTJ 


normal! xa 


“well I went, normal’ 


Zaca, dé pantalá mezclilla zacá 
zacal! de pantalon! mezclilla zacal! 
that.way PP pants jean that.way 
‘that way, with jean pants’ 

ira ní gúye que lá 

guira’ H nitH guye quel! laH 

all REL COMPL-gO DIST LA 


“all the ones that went’ 
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135 


136 


137 


138 


139 


140 


141 


162 


puro  depánts laaca xa 
puro de pan'ts laaca xa 
all PP pants also INTJ 


'only in athletic pants' 
ójo, püro  profesionál 


oHjo puro profesional! 
INTJ all professional 


“wow, all professionals” 


puro  depánts ténis jaa 
puro de pants tenis jaa 
all PP pants tennis.shoes INTJ 


“all in athletic pants, tennis shoes, huh' 


ténis qué zínié 
tenis quel zi-nelH=4H 


tennis.shoes NEG FUT-bring=1sG 


T didn't bring tennis shoes' 


pero ziniá pantalón de mezclílla 


pero zi-neH-a'H pantalon! de mezclilla 


but FuT-bring=1sG 


“but I brought jeans' 
óra  biiyá lá 
olHra bi-uuya-a'H — ]aH 


when cOMPL-see-1sG LA 

"when I saw? 

biiya cabinni qué né pants también 
bi-uuya-a'H LH quel nel! pants! tambien 
COMPL-see-1sG PL person DIST and pants also 


ca binni H 


‘I saw the people with pants also’ 
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: 143 


: 144 


: 145 


: 146 


: 147 


: 148 


peru qué gunebia’ya diá laacabé 
H 


peru que gu-nebia'ya'-di-a'H laacabel 


but ComPL-know-NEG=1SG BASE=PL=3.HUM 


“but I didn't know them at all 


má guxudxilw’ jaa 
ma!  gu-xudxi-lu' jaa 


already COMPL-drunk=2SG INTJ 


"You were already drunk huh' 


stale  bínní núu stálé 
stale-H binni-H n-uulH — stalelH 
many person STAT-to.be many 


“Many, many people there” 


ah puessi ¿candá cérvéza  ruaalu ya? 
ah puessi ca-nda' cerveza ruaa-lu  — ya 
INTJ well yes PROG-smell beer mouth-2sc Q 


"Well, yeah, your mouth was smelling like beer huh?” 


stale binni nuu raqué 
stale“! binni! nuulH raquel! 


many person STAT-to.be then 


“There were many people then’ 


stale xhó nuu ruáalu' 
stalel! xho’ n-uulH — ruaalw 
many smell stat-to.be mouth-2sc 


“Many smells in your mouth' 


stale — bádudxaapa 
stale badudxaapa 
many gir 


“Many women' 
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149 stale bádunguiiu 
stale badunguiiu 


many boy 
'many men' 
150 de ira’- 
de guira A 
PP all 
‘of all-’ 
151 aja, de ira clase 
aja de guira’! claHse 
yes PP all kind 
‘of all kinds’ 
152 de ira médida 
de guira’! medida 
PP all sizes 
“of all sizes” 
153 peruara guyuúdú lá 
peru ara gu-uulH=dulH la? 


but now compPL-to.be=1PL.EXCL LA 


“but now we are there, 


154 para bidiicabe láadú lá 
para bidiicabe!# laadu! laĦ 
for COMPL-give=PL=3.HUM LAA=1PL.EXCL LA 


‘for them to give us; 


155 ti número lá 
ti numero la! 
one number LA 


‘a number’ 


164 


156 


157 


158 


159 


160 


161 


162 


ne tí pláyéra 
nelH ti  playelHra 
and one shirt 

“and a shirt’ 

óraque lá 

oH raquelH laH 
now LA 
“then, 


bisigade que lá 
bi-siga’de’ quel! ]aH 
COMPL-give DIST LA 


'that was given' 


óra ti paquété lá 
ora ti paquete laH 


when one package 

"when a package' 

bisiga de  maizéná 1a 

bi-siga'de' maizena aH 
COMPL-give Maizena LA 


‘Maizena was given’ 

ti naga' nda Espráit 
ti naga'nda Esprait" 
one STAT-cold Sprite 


“a Sprite soda' 


bini patrocinár Coca Cóla 
bi-i'ni patrocinar"! Coca Co.Hla 


COMPL-do sponsor Coca Cola 


“Coca Cola sponsored (the event)’ 
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163 Espráit lá 
Esprait™ laH 
Sprite LA 


“Sprite” 


164 chupa  máizéná lá 


LH 


chupalH maizena laH 


two Maizena LA 


“two Maizena’ 


165 ma nguesi zulua’ 
malH ngue-silH 
already DEM- FUT-believe-1sc 


z-ulu-a'H 


‘that’s it, I think’ 


166 peru la 
peru la? 
but LA 


"but 
167 ze ti bólsa ti paquéte pue 


z-e ti bolÉsati paque!te pues 
FUT-go one bag one package well 


“there came a bag, well, a package 


168 ah ne  zeéda gadxe revista raqué 
ah nelHz.eelHda gadxe revista raque 
INTJ and FUT-come different magazine then 


LH 


‘and there came a different magazine’ 


169 revista de corredóré láni 


revista de corredo Hres la H=ni 


magazine Corredores name=3SG.INAN 


'its name is Corredores (Runners) magazine 
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170 


171 


172 


173 


174 


175 


176 


casiti libru pue 
casiti libru pues 
like one book well 


“(It's) like a book’ 


informaci ne ni 
informacion! nel! nilH 


information with 3sG.INAN 


‘with information’ 


ne  zeéda ti jaa ti planu 
nelH z-eelHda ti jaa ti planu 
and FUT-come one INT] one map 


'and there came a map' 


ti plánú la 
ti plalHnu laH 
one map LA 
“a “planu” 

o ti mápa 
o ti mapa 
or one map 


€ “ ” 
ora mapa 


irá ní pue para información 
guira’ H pitt pues para informacion 
all 3SG.INAN well for information 


H 


‘well, all of it for information’ 


para săbado 
para salĦbado 
for Saturday 


‘for Saturday’ 
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177 


178 


179 


180 


181 


182 


183 


168 


domíngo que lá 
domi" ngo que laH 
Sunday DIST LA 


‘on Sunday’ 


domingo siádo que lá 
domingo siaHdo' quel! laH 


Sunday morning DIST LA 


€ . > 
on Sunday morning, 


pue como nayaa nua 


pues como nayaa — n-uulH-4'H 


a 


qué lá, 


q 


uel! ]¿H 


well as STAT-raw STAT-to.be-1sG DIST LA 


‘because I was hungover’ 


cádi nadana? endaro' xa 
calHdi na-da'/na! endaro' xa 
NEG  STAT-temptfood INTJ 


‘food wasn't appetizing’ 


ma candaana gueela’ 
ma'H  ca-ndaana-a'H gueela' 
already proc-be.hungry=1sc night 


T started to be hungry at night' 


udahuá normál 
gu-dahua!’ normal! 
COMPL-eat.1sG normal 


Tate normal (as I normally would)’ 


normal udahua’ 
normal! gu-dahua’# 
normal coMPr-eat.1sc 


‘I ate NORMAL (as I normally would) ' 
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185 


186 


187 


188 


189 


190 


perodomíngo siádo dxi maratón qué lá 


pero domingo siaHdo' dxi maraton! quel! laH 


but Sunday morning day marathon DIST LA 


“but on Sunday morning on the day of the marathon' 


ante de las ocho chuudu pendiente 
ante de las ocho ch-uu-du pendien*te 
before PP the eight POT-g0=1PL.EXCL matter 


“before 8 o'clock we had a place to be' 


bira géela domingo 
biralH ^ gueela' domi" ngo 
coMPL-end night Sunday 


“Sunday at dawn' 


bibané lá 


H H 


bi-bani=a’ la 


COMPL-wake.up=1sG LA 


‘I woke up, 
guzé xa 
gu-zi-a'H xa 


COMPL-shower-1sG INTJ 


‘I showered, 
güé ti jügo  denaránjasi xá 
gü-e-a'H ti jul“ go de nara! nja-si! xa 


COMPL-drink-1sc one juice of orange-only INTJ 


'I drank an orange juice only: 


pero naa lá 
pero naa la 
but 1sG LA 


‘but I} 
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191 


192 


193 


194 


195 


196 


197 


170 


rabé fácilni xá 
r-abiza'H faHcilenilH xa 
HAB-say-1sG easy=3SG.INAN INTJ 


‘I say it was easy’ 


oséa  nagueendani pué 
ose Ha na-gueenda-ni™ pues 
so STAT-fast-3sc.INAN well 


“well, it was fast’ 


qué ñuné” este pensárdiá de que pa 
H 


que! ñ-uni=a’™! este pensar H-di-a'H de que pal 


NEG IRR-do-1sc um think-NEG-1sc PP that if 


‘I didn't even think whether it was far’ 


óra guyuudu qué xa 
olHra guyuudu! quel xa 


when COMPL-go=1PL.EXCL DIST INTJ 


“the time we went’ 


má gundaa las ocho 


ma'H  gu-ndaa las ocho 


already COMPL-be.late the eight 


“already after eight’ 
má cayete — tutiiisí qué lá 
ma'H  ca-ete — tutiil Isi! H quel! laH 


already PROG-fall everyone DIST LA 
“already everyone had fallen’ 
qué uyuti jaa 


queH gu-ati jaa 
NEG COMPL-die INTJ 


*not dead, huh' 


zítüni 
zitu=nilH 


far=3SG.INAN 
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199 


200 


201 


202 


203 


204 


pue casicagunaa que xa 
pues casi ca guaa quel! xa 


well like PL woman DIST INTJ 


“well, like the women; 


pantalón de mezclilla 
pantalon” de mezclilla 

pants of jeans 

‘jean pants’ 

óraque sié lá 
o-HraquelH si=a'H laH 
now COMPL.buy=1sG LA 
‘now I bought’ 

ti shórt 

ti short! 

one short 

‘shorts’ 

sié ti par těnis 
si-a'H ti par tenis 


COMPL.buy-1sc one pair tennis.shoes 


T bought a pair of tennis shoes” 


iruti qué runebia’ya 

gui ruPti'H que? r-unebia'=a 
nobody NEG HAB-know-1sc 
T didn't know anyone' 

stubé” 

stubiH=aH 

alone 


‘I was alone’ 
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205 peruóra  bira guéela lá 
peru ora biralH gueela’ la? 
but when compt-end night rA 


“but at dawn’ 


206 vuélta jügo de naránjanguésí ^ güé pue 
vuelÉtati ^ ju!Hgo de nara Anja nguel H-sjHH gu-e-a'H pues 


again juice of orange DEM- COMPL-drink-1sc well 


'again, well, I drank just an orange juice' 


207 nin quí fiahuadiá de endaré gasti’ 
nin qui fi-ahua-di-a'H de guendaro-a' gasti'H 
not.even NEG IRR-eat/drink-NEG=1sc of food-1sc nothing 


‘I didn't even eat/drink any of my food’ 


208 jügo  quesí güé 
jugo quell sit gu-e-a' 
juice DEM-only cowrr-eat/drink-1sc 


H 


‘I drank only the juice’ 


209 iza má stale  bínní xa 
ri-iza malH stalel! binnilH xa 
HAB-walk already many person INTJ 


‘many people were walking’ 


210 cuzeetecabe lú sonido qué 
cu-zeete-ca-bel H lu sonido que! 


PROG-mention=PL=3SG.HUM PP sound DIST 


‘they mentioned that on the sound system’ 


211 cuzeetecabe ira ni  chúxooñe ca lá 
cu-zeete-ca-belH guira H ni chu-xoofieca la 
cu-zeete=ca=beH all REL POT-run DEM LA 


H 


“they mentioned all those who were going to run' 
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212 


213 


214 


215 


216 


217 


218 


zuhuaacabé en fila lá, 
zuhuaa-ca-belH en fila laH 
FUT.stand-PL-3sG.HUM in line LA 


“they were all standing in line” 


purtí ma las ocho dela mañána chuzulu 
purti'H ma'H las ocho de la mañana chu-zulu 
because already the eight of the morning  Por-begin 


"because it would begin at eight in the morning' 


chi guiaaxha cabinni cá chuxooñeca 
chi gui.P-aaxha ca binni ca chu-xoofie=ca=@ 


POT POT-start PL person DEM POT-run=PL=3 


'the people would begin to run' 


¿pabiá”, ^ este, cayuninacabe lá, 
pabiaH ^ este ca-uni-na-ca-bel! laH 


how.much INTJ PROG-do-say=PL=3SG.HUM LA 
‘How many, um, were they saying’ 
chuxooñe? 


chu-xooñe? 
POT-run 


“would run?” 


unaa ne jaa hombre lá, 
gunaa ne jaa hombre la 
woman and INTJman LA 


“Women and men? 


nacabe cáunaa ca la 
na-ca-belH cagunaa ca la 
Say-PL-3SG.HUM PL Woman DEM LA 


H 


'they say the women; 
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219 ziaaxhaca primér nacabé 


H 


z-iaaxha=ca=@ primer! na-ca-beH 


FUT-start=PL=3 first say=PL=3SG.HUM 


“they started first 


220 tonce ca hómbre ca lá, 


tonce ca hombre ca lal 
then PLman DEM LA 
“so the men; 


221 pues rarirópa 
pues rariro "pa 
well second 


“well, second” 


222 tonce cani ma ca huaxooñe ca lá, 
tonce ca ni! ma'H ca hua-xooñe ca la 
then PLREL already PL PERF-run DEM LA 


“so those who had already run; 


223 chuu  delánte 
chuu delante 
POT-go front 


‘would go in front’ 


224 ni jmá qué huaxooñe que lá, 
nilH jma! que” hua-xoofie que la 
PL REL more NEG PERF-run DIST LA 


‘the majority who had not run? 


225 chuu de atrás 
chuu de atras! 
POT-go PP back 


‘would go in back’ 
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T: 226 


M: 227 


228 


229 


230 


231 


232 


¿xi módo? 
xilH modo? 
what way 


“In what way? 


peru óraque cáyaca colocár ira bínni qué 
peru olHraque!" ca-aca colocar guira’! binni 


but then PROG-occur place all person DIST 


‘But then all the people were placed,” 


nuu dé ira cláse, 
nuul.H de guira’ clase 
DIST  INTJ STAT-to.be of all 


“there were many different types” 


nuu dé ira médida 
nuulH de guira’! medida 
STAT-to.be of all size 


‘they were of all sizes’ 


ira núu huániisi también ya 
guira H nuu —  huaniisi! tambien! ya 
all STAT-to.be older also INTJ 
“all were older also’ 

peru iza stale — bínni 

peru iza stale“ þinni!" 


but Por-to.walk many person 


“but many people went 


zuluasiá quince míl 
z-ulua-si-a'H quince mil 
FUT-think-even-1sc fifteen thousand 


‘I think fifteen thousand’ 


LH quel 


pue 
pue 
INTJ 
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233 quince mil participante parecesi 1a, 
quin! 
fifteen thousand participants seem-only LA 


ce mil! participante parece-si laH 


‘about fifteen thousand only’ 


234 o diesisés míl 
o diesiseis mil" 
or sixteen thousand 


‘or sixteen thousand’ 


T: 235 ¿pero cuarenta-y-dós kilómetro  bixooñelu 14? 
pero cuarenta-y-dos! kilo! metro bi-xoofie-lu' laH? 
but forty-two kilometer  COMPL-run=2SG LA 


“But you ran forty-two kilometers?” 


M: 236 ya, bixooné’ 
ya, bi-xoofie-a'H 


yes coMPL-run-1sG 


“Yes, I ran’ 


237 sti dxi que lá 
sti dxi quel Ja 


other day DIST LA 


‘the next day; 


238 chi guné dxiiña 
chi — gu-.-i'ni-a'H dxiiña 


POT.go POT-do=1SG work 


‘I went to work’ 


239 ti semăna  gutá' 
ti semana gu-ta-a' 


one week COMPL-lay.down-1sc 


H 


‘I laid down one week’ 
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240 


241 


242 


243 


244 


245 


246 


zacá nachonga ñée” 


zacalH ^ nachonga fiee=a’ 


that.way sTAT-stiff leg-1sc 


H 


“my leg was stiff like this” 


cádi chicharrónchonga 
caHdi chicharronH-chonga 
NEG pork.rind-stiff 


“not a stiff pork rind' 


óra  biluxe maratón que lá 
olHra bi-luxe maraton" quel H laH 


when COMPL-end marathon DIST LA 


“when the marathon ended; 


chonna (óra ne cuarénta ytanto minüto  bixooñé ni 


LH oLH LH H :LH 


chonna ra nelH cuarenta- y-tanto minuto bi-xooñe=a ni 


three hour and forty- some minute COMPL-run=1sG 3SG.INAN 


T ran it in three hours forty-some minutes” 


¿tu bíni ganar ni? 
tulH bi-bni ganar! nil E 
who coMPL-do win 3SG.INAN 


^who won it?' 


chupa ora ne  quínce minútó lá, 
chupalH oLHra ne! quill 


two hour and fifteen minute LA 


nce minuto la! 


*two hours and fifteen minutes' 


chupa ora ne  gándé  mínáto zulua’ 


LH oL H H 


chupa ra nel! gande! minuto zulu=a’ 


two hour and twenty minute rur.believe 


‘I think two hours and twenty minutes’ 
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247 badudxaapa que lá 
badudxaapa quel! laH 
woman DIST LA 


“the woman, 


248 mexicána ca lá 
mexicaHna ca laH 


Mexican DEM LA 


“the Mexican [runner]; 


249 laacachupa  órá lá 


LH oL H 


laaca chupa ra laH 


also two hour LA 


*also two hours' 


250 perujma minuto 
peru jmaE minuto 


but more minute 


‘but more minutes’ 


251 casi chonna ora zulua’ 
casi chonna ora zulu=a' 
like three hour rut.believe=1sc 


H 


“almost three hours I think’ 


252 naalá 
naa laH 
ISG LA 


“as for me” 


253 pue casitapa Óra 


pues casi tapa oH ra 


well like four hour 


“well, about four hours” 
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254 


255 


256 


257 


258 


259 


260 


óra  biiyá stale binni zéeda la 
olHra bi-uuya-a'H stale! þinni! z-eeda laH 


when comPL-see=1sG many person FUT-arrive LA 


‘when I saw many people arrive’ 


zeeda badunguiiu badudxaapa 

z-eeda badunguiiu badudxaapa 

FUT-arrive man woman 

“men, women arrived’ 

zaca rúlui biri laaca zeedaca 
zacal!  ru-lui biri laa=ca=® z-eeda=ca=@ 


that.way HAB-show ant BASE=PL=3 FUT-arrive=PL=3 


‘that way they seemed like ants as they were arriving’ 


buénu ti semăna  gutá 
buelHnuti | semalEna guta=a H 
well one week COMPL-lay.down-1sc 


“well, I laid down for one week’ 


que! ganda saa 
que! g-anda=a H saa 


NEG COMPL-be.able=1sG party 


T couldn't go to any parties’ 


guira  dxí cadaabia’ 
guiraH dxi ca-daabi-a'l 
all day PROG-massage-1sc 


H 


‘I massaged myself every day’ 


bidiicabe náati medálla 
bi-dii=ca=bel#! naati medalla 
COMPL-give=PL=3SG.HUM 1sG one medal 


“they gave me a medal 
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261 


262 


263 


264 


265 


266 


267 


180 


ira ní  yénda lu métá lá 
guira’ H nilH yenda lu meta la?! 
all REL arrive.here PP goal LA 


‘everyone that arrived at the finish line’ 


cuacani 


cua=ca=ni-H 


grab=PL=3SG.INAN 


“got one” 
yanna má quí udxela foto qué 
yanna ma qui gu-dxela foto que 


now already NEG COMPL-find-1SG DIST 


“now I can't find the photo' 


qui gápa foto stinné 
quil! g-apa foto stinne-a' 
NEG CowPr-have photo mine 


H 


'I didn't have my picture' 


¿tu lá bini ganár, este, primér lugár? 
tulH laH p-ini ganar"! este primer" lugar! 
who name COMPL-do win INTJ first place 


"Who won, um, first place?' 


ti militár bini ganár dxiqué 
ti militar" bi-ini ganar! dxiquel 


one soldier coMPL-do win then 


*a soldier won then' 


naa quí ñapadiá entrenadór 
H ñ-apa-di=a H 
lsG NEG IRR-have-NEG=1SG trainer 


naa qui entrenador 


‘T didn't have a trainer’ 
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269 


270 


271 


272 


quí ñápá” 
quill ñapa=a' 
NEG IRR-have-1sc 


H 


‘I didn't have’ 


naa stubesia’ 
naa stube-si-a'H 


1sG alone=only=1sG 
“it was just me' 


huati qué 
huati-H que! 
dumb DIST 


“that was dumb' 


nga nga  rúxooñe 


H 


ngaH ngal ru-xooñe 


DEM NGA HAB-run 


‘that’s what it is to run’ 


peru ruxooñe riésgo 
peru ruxooñe riesgo 
but HAB-run risk 


“but to run a risk 
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Information structure in Isthmus 
Zapotec narrative and conversation 


Three main observations motivate this study: 


* the combination of the existing documentation and a relatively large and active 
speaker community offer a unique opportunity to document information structure 
in ZAI and to study the language as it is used by speakers in everyday life; 


as atonal and verb-initial language, the study of ZAIrepresents a chance to explore 
the possible combinations of tone, intonation, morphology and verb-initial syntax 
that may occur in the coding of information structure, and 


the analysis of an endangered language contributes to our theoretical understand- 
ing of information structure and informs our knowledge of language documenta- 
tion practices and revitalization efforts. 


Overall, the analysis demonstrates the value of and need for information structure 
studies to document and analyze spontaneous and naturally-occurring discourse, partic- 
ularly in understudied and endangered languages. 
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